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(54) Title: ANTIBACTERIAL TARGETS IN ALLOIOCOCCUS OTITID1S 



— - (57) Abstract: The present invention relates to the identification of polynucleotide sequences encoding polypeptides of Alloiococcus 
2 otitidis that are essential for the growth and survival of the bacteria. In particular, the invention relates to polypeptides encoded by 
the Alloiococcus otitidis open reading frames (ORFs), and to their use in pharmaceutical compositions, therapeutics, diagnostics 
^ and the like. The present invention also relates to methods for identifying pharmaceutical compounds that inhibit the activity of the 
^ polypeptides that are essential for the growth oiAlloiococcus otitidis. to pharmaceutical compositions containing these compounds 
^ and to their use in treatment and amelioration of diseases caused by Alloiococcus otitidis 
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ANTIBACTERIAL TARGETS IN ALLOIOCOCCUS OT1TIDIS 

Field of the invention 

The present invention relates to the genomic sequence of Alloiococcus otitidis 
5 and polynucleotide sequences encoding polypeptides of the Gram-positive 

bacterium, Alloiococcus otitidis. The invention also relates to polynucleotides and 
polynucleotides encoding polypeptides, preferably antigenic polypeptides, encoded 
by the Alloiococcus otitidis open reading frames and the uses thereof. 

10 Background of the invention 

Since the discovery of penicillin, the use of antibiotics to treat the ravages of 
bacterial infections has saved millions of lives. With the advent of these "miracle 
drugs," for a time it was popularly believed that humanity might, once and for all, be 

15 saved from the scourge of bacterial infections. In fact, during the 1 980s and early 
1990s, many large pharmaceutical companies cut back or eliminated antibiotics 
research and development. They believed that infectious disease caused by bacteria 
finally had been conquered and that markets for new drugs were limited. 
Unfortunately, this belief was overly optimistic. The tide is beginning to turn in favor of 

20 the bacteria, as reports of drug resistant bacteria become more frequent. The United 
States Centers for Disease Control and Prevention announced that one of the most 
powerful known antibiotics, vancomycin, was unable to treat an infection of the 
common bacterial pathogen, Staphylococcus aureus. This organism, commonly 
found in our environment, is responsible for many nosocomial infections. The import 

25 of this announcement becomes clear when one considers that vancomycin was used 
for years to treat infections caused by Staphylococcus species as well as other 
stubborn strains of bacteria. In short, bacteria are becoming resistant to our most 
powerful antibiotics. If this trend continues, it is conceivable that we will return to a 
time when what are presently considered minor bacterial infections are fatal 

30 diseases. 

Over-prescription and improper prescription habits by some physicians have 
caused an indiscriminate increase in the availability of antibiotics to the public. The 
patients are also partly responsible, since they will often improperly use the drug, 
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thereby generating yet another population of bacteria that is resistant, in whole or in 

part, to traditional antibiotics. 

The bacterial pathogens that have haunted humanity remain, in spite of the 

development of modern scientific practices to deal with the diseases that they cause. 
5 Drug resistant bacteria are now an increasing threat to the health of humanity. A new 

generation of antibiotics is needed to once again deal with the pending health threats 

that bacteria present. 

As more and more bacterial strains become resistant to the panel of available 

antibiotics, new antibiotics are required to treat infections. In the past, practitioners of 
10 pharmacology relied upon traditional methods of drug discovery to generate novel, 

safe and efficacious compounds for the treatment of disease. Traditional drug 

discovery methods involve blindly testing potential drug candidate- molecules, often 

selected at random, in the hope that one might prove to be an effective treatment for 

some disease. The process is painstaking and laborious, with no guarantee of 
15 success. 

Newly emerging practices in drug discovery utilize a number of biochemical 
techniques to provide for directed approaches to creating new drugs, rather than 
discovering them at random. For example, gene sequences and proteins encoded 
thereby that are required for the proliferation of a cell or microorganism make 

20 excellent targets since exposure of bacteria to compounds active against these 

targets would result in the inactivation of the cell or microorganism. Once a target is 
identified, biochemical analysis of that target can be used to discover or to design 
molecules that interact with and alter the functions of the target. Use of physical and 
computational techniques to analyze structural and biochemical properties of targets 

25 in order to derive compounds that interact with such targets is called rational drug 
design and offers great potential. Thus, emerging drug discovery practices use 
molecular modeling techniques, combinatorial chemistry approaches, and other 
means to produce and screen and/or design large numbers of candidate compounds. 
Nevertheless, while this approach to drug discovery is clearly the way of the 

30 future, problems remain. For example, the initial step of identifying molecular targets 
for investigation can be an extremely time consuming task. It may also be difficult to 
design molecules that interact with the target by using computer modeling 
techniques. Furthermore, in cases where the function of the target is not known or is 
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poorly understood, it may be difficult to design assays to detect molecules that 
interact with and alter the functions of the target. To improve the rate of novel drug 
discovery and development, methods of identifying important molecular targets in 
pathogenic ceils or microorganisms and methods for identifying molecules that 

5 interact with and alter the functions of such molecular targets are urgently required. 

The present invention is directed to identifying important molecular targets in 
a recently identified bacteria, Alloiococcus otitidis, which has been implicated in otitis 
media with effusion (OME). Otitis media, an inflammatory disease of the middle ear, 
is the most frequent cause of visits to pediatricians' offices in the United States 

10 (Schappert, 1991). Approximately 80% of all children experience at least one episode 
of otitis media by the age of three (Klein, 1994). There are three main types of otitis 
media: Acute otitis media (AOM), otorrhea, and otitis media with effusion (OME). 
Alloiococcus otitidis has only been associated with otitis media with effusion (OME), 
but this may be due to the difficulty of its detection by standard bacterial culturing 

15 methods. Its detection in the effusions is likely due to the fact that the effusions are 
normally sterile and few or no competing bacterial species are isolated from them. 
Without the interference of faster growing nasophryngeal species, the culture plates 
can be incubated for the longer duration needed to detect Alloiococcus otitidis 
colonies. 

20 Three other bacterial species are commonly isolated from middle ear 

effusions. These are nontypeable Haemophilus influenzae, Moraxella catarrhalis, and 
Streptococcus pneumoniae. One or more of these species have been found in one 
study to be associated with about 77% of all cases of OME using a PCR detection 
method (Post, 2000). This study did not include assaying for Alloiococcus otitidis, so 

25 a portion of the unaccounted cases may be due to this organism. 

The bacterium Alloiococcus otitidis was first isolated from the middle ear 
fluids of 10 children in the Buffalo, NY area with persistent OME and characterized as 
a large catalase negative, Gram-positive cocci that tend to occur in clumps, often in 
tetrads. It is slow growing and requires 2 to 5 days at 37°C before colonies can be 

30 seen on sheep blood agar plates. The bacterium was named Alloiococcus otitis by 
Aguirre and Collins (1992), who showed that it was different from other known Gram- 
positive species based on its 16S rRNA sequence. The bacterium's name has been 
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changed from Alloiococcus otitis to Alloiococcus otitidis. (Hendolin, et aL, (1999), and 

Hendolin et aL, (2000)). 

Several studies of the epidemiology Alloiococcus otitidis indicate it is 

associated with otitis media with effusion. These are summarized in Table 1 . These 
5 studies have been done using both culture and PCR techniques. The number of 

cases detected by culture, as might be expected from the fastidious growth 

requirements of the bacterium, was less than the number detected by PCR. 

Assuming that the bacterium is detected more accurately by the PCR method, the 

bacterium is detected in between 10 and 50% of patients with OME. This frequency 
10 suggests that this organism represents a significant public health problem. 

Consequently, there is a need for identifying gene targets in Alloiococcus otitidis for 

the development of anti-infectives. There is also a need for compositions for 

diagnosing Alloiococcus otitidis infection. 

15 

TABLE 1 : SUMMARY OF STUDIES INDICATING AN ASSOCIATION OF ALLOIOCOCCUS 



OTITIDIS WITH OTITIS MEDIA WITH EFFUSION (OME). 



% 

detected 


N a 


Method 


Reference 


8 


200 


Culture 


Faden & Dryja, J. Clin. Microbiol. 27:2488 (1 989) 


3 


100 


Culture 


Sih etal., ICAAC (1992) 


20 


25 


PCR 


Hendolin et aL, J. Clin. Microbiol. 35:2854 (1997) 


50 


12 


PCR 


Beswick, et aL, Lancet 345:386 (1 999) 


42 


67 


PCR 


Hendolin, et aL, Pediatr. Infect. Dis. J. 18:860 (1999) 


10 


49 


PCR 


Hendolin et aL, J. Clin. Microbiol. 38:125 (2000) 



Number of persons in study. 
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SUMMARY OF INVENTION 

The present invention broadly relates to Alloiococcus otitidis genomic 
sequence. Particularly, the invention relates to newly identified polynucleotide open 
5 reading frames (ORFs) comprised within the genomic nucleotide sequence of 
Alloiococcus otitidis, and to polypeptides encoded by the ORFs. More particularly, 
the ORFs encode polypeptides that are essential for the growth and survivablity of 

Alloiococcus otitidis. 

Thus, in certain aspects, the invention relates to Alloiococcus otitidis ORFs 

10 that encode Alloiococcus otitidis polypeptides that function as enzymes in various 
biosynthetic pathways in the bacterium, in one embodiment, the invention relates to 
a purified or isolated Alloiococcus otitidis nucleic acid sequence comprising a 
nucleotide sequence selected from one of odd numbered sequences set forth in Seq. 
ID Nos: 1 to Seq. ID Nos: 105, wherein expression of said nucleic acid is essential for 

15 the proliferation of a cell. In a preferred embodiment the ORF selected from one of 
the odd numbered sequence listings set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105 
encodes an essential gene. The essential gene and the polypeptide encoded by 
them include ACPS (holo-(acyl carrier protein) synthase), murF (UDP-N- 
acetylmuramoylalanyl-D-glutamyl-2,6-diamino pimelate-D-alanyl-D-alanyl iigase) 

20 murA-2 (UDP-N-acetylglucosamine 1 -carboxyvinyitransf erase), RpoE (DNA-directed 
RNA polymerase, delta subunit), rpoA (DNA-directed RNA polymerase alpha 
subunit), rpoC (RNA polymerase beta' subunit), rpoB (DNA-dependent RNA 
polymerase subunit beta), dnaB/C (DNA polymerase III delta prime subunit), gyrA 
(DNA gyrase A subunit), gyrB (DNA gyrase B subunit), dnaN (DNA polymerase III 

25 beta chain, folC-2 (folyl-polyglutamate synthetase), murE (UDP-N-acetylmuramoyl-L- 
alanyl-D-glutamyl-L-lysine Ligase), srtA (sortase), folC-1 (folyl-polyglutamate 
synthetase), folB (dihydroneopterin aldolase), folK (7,8-dihydro-6- 
hydroxymethylpterin-pyrophosphokinase), mvaS (hydroxymethylglutaryl-CoA 
synthase), mvaA (3-hydroxy-3-methylglutaryl-coenzyme a reductase), murB (UDP-N- 

30 acetylglucosaminyl-3-enolpyruvate reductase), mvaK2 (phosphomevalonate kinase), 
mvaD (mevalonate diphosphate decarboxylase), mvaK1 (mevalonate kinase), coaA 
(pantothenate kinase), nadE (NAD+ synthase), murl, Glutamate racemase), folP 
(Dihydropteroate synthase), folA (dihydrofolate reductase), grIB (topoisomerase IV B 
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subunit), gr!A (topoisomerase IV A subunit), rpoD (transcription initiation factor 
sigma), dnaG (DNA primase), era (GTP-binding protein), norA (drug-export protein), 
polC (DNA polymerase III, alpha subunit), obg (GTP-binding protein), yphC (similar 
to Escherichia coli GTP-binding protein Era), dnaE (DNA polymerase III, alpha 

5 subunit), coaBC (phosphopantothenoylcysteine synthetase/decarboxylase), holA 
(DNA polymerase III delta subunit), coaD (phosphopantetheine adenylyltransferase) 
ftsZ (Cell division protein ftsZ), ftsA (Cell division protein ftsA), murG (phospho-N- 
acetylmuramoyl-pentapeptide-transferase), murD (UDP-N-acetylmuramoylalanine D- 
glutamate ligase), nadD (nicotinic acid mononucleotide adenylyltransferase), coaE 

10 (dephospho-CoA kinase), murC (UDP-N-acetyl muramate-alanine ligase), fmhB 
FemX (factor essential for methiciilin resistance), pcrA (ATP-dependent DNA 
helicase), murA-1 (UDP-N-acetylglucosamine 1-carboxyvinyltransferase), holB (DNA 
polymerase III delta 1 subunit) and dnaX (DNA polymerase III -gamma and tau 
subunits). 

15 In another embodiment, the invention relates to purified or isolated nucleic 

acid of Alloiococcus otitidis comprising a fragment of one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, wherein said fragment is 
selected from the group consisting of fragments comprising at least 10, at least 20, at 
least 25, at least 30, at least 50 and more than 50 consecutive nucleotides of one of 

20 one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In yet another embodiment, the invention relates to a purified or isolated 
antisense nucleic acid comprising a nucleotide sequence complementary to at least a 
portion of an intragenic sequence, intergenic sequence, sequences spanning at least 
a portion of two or more genes, 5' noncoding region, or 3' noneoding region within an 

25 operon comprising a proliferation-required gene of Alloiococcus otitidis whose activity 
or expression is inhibited by an antisense nucleic acid and selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In a nother embodiment, the invention relates to a purified or isolated nucleic 
acid comprising a nucleotide sequence having at least 70% identity to a nucleotide 

30 sequence selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 
to Seq. ID Nos: 105, fragments comprising at least 25 consecutive nucleotides 
selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID 
Nos: 105, the nucleotide sequences complementary to one of odd numbered 
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sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, and the sequences 
complementary to fragments comprising at least 25 consecutive nucleotides of one of 
odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In another embodiment, the invention relates to a vector comprising a 

5 promoter operably linked to a nucleic acid encoding a polypeptide whose expression 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence of any 
one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

In another embodiment, the invention relates to purified or isolated 
polypeptide of Alloiococcus otitidis comprising a polypeptide whose expression is 

10 inhibited by an antisense nucleic acid comprising a nucleotide sequence of one of 
odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a 
fragment selected from the group consisting of fragments comprising at least 5, at 
least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the said polypeptides. 

15 in yet another embodiment, the invention relates to purified or isolated 

Alloiococcus otitidis polypeptide comprising a amino acid sequence having at least 
25% amino acid identity to a polypeptide whose expression is inhibited by a nucleic 
acid comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or at least 25% amino 

20 acid identity to a fragment comprising at least 1 0, at least 20, at least 30, at least 40, 
at least 50, at least 60 or more than 60 consecutive amino acids of a polypeptide 
whose expression is inhibited by a nucleic acid comprising a nucleotide sequence 
selected from the group consisting of one of odd numbered sequences set forth in 
Seq. ID Nos: 1 to Seq. ID Nos: 105. 

25 In one embodiment, the invention relates to a purified or isolated Alloiococcus 

otitidis polypeptide comprising selected from one of the even numbered sequences 
set forth in Seq. ID Nos: 2 to Seq. ID Nos: 106, wherein the polypeptide is essential 
for the proliferation of a cell.. 

In yet another embodiment, the invention relates to a method of producing an 

30 Alloiococcus otitidis polypeptide comprising introducing into a cell a vector 

comprising a promoter operably linked to a nucleic acid comprising a nucleotide 
sequence encoding a polypeptide whose expression is essential for the proliferation 
and viability of Alloiococcus otitidis, and which is inhibited by an antisense nucleic 
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acid, and which is selected from one of odd numbered sequences set forth in Seq. ID 
Nos: 1 to Seq. ID Nos: 105. 

In yet another embodiment, the invention relates to a method of inhibiting the 
proliferation of Alloiococcus otitidis in an individual comprising inhibiting the activity or 

5 reducing the amount of a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105 or inhibiting the activity or 
reducing the amount of a nucleic acid encoding said gene product 

In a preferred embodiment, the invention relates to method for identifying a 

10 compound which influences the activity of an Alloiococcus otitidis gene product , 
which is required for proliferation, said gene product comprising a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 
to Seq. ID Nos: 105, said method comprising: (a) contacting said gene product with a 

15 candidate compound; and (b) determining whether said compound influences the 
activity of said gene product. 

In a preferred embodiment, the invention relates to method for identifying a 
compound or an antisense nucleic acid having the ability to reduce activity or level of 
a Alioiococcus otitidis gene product, which is required for proliferation, said gene 

20 product comprising a gene product whose activity or expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, said method 
comprising the steps of: (a) contacting a target gene or RNA encoding said gene 
product with a candidate compound or antisense nucleic acid; and(b) measuring the 

25 activity of said target. 

In yet another preferred embodiment, the invention relates to method for 
inhibiting cellular proliferation of Alloiococcus otitidis comprising introducing an 
effective amount of a compound with activity against a gene whose activity or 
expression is essential for cellular proliferation, and which is inhibited by an 

30 antisense nucleic acid comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a compound 
with activity against the product of said gene into a population of Alloiococcus otitidis 
cells expressing said gene. 



8- 



WO 03/104391 



PCT/US02/36122 



In a preferred embodiment, the invention relates to a composition comprising 
an effective concentration of an antisense nucleic acid comprising a nucleotide 
sequence selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 
5 to Seq. ID Nos: 1 05, or a proliferation-inhibiting portion thereof in a pharmaceutical^ 
acceptable carrier. 

In a preferred embodiment, the invention relates to method for identifying a 
compound having the ability to inhibit proliferation of Alloiococcus otitidis cell 
comprising: (a) identifying a homologue of a gene or gene product whose activity or 

10 level is inhibited by a nucleic acid comprising a nucleotide sequence selected from 
one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, in a 
test cell, wherein said test cell is not Alloiococcus otitidis; (b) identifying an inhibitory 
nucleic acid sequence which inhibits the activity of said homologue in said test cell; 
(c) contacting said test cell with a sublethal level of said inhibitory nucleic acid, thus 

15 sensitizing said cell; (d) contacting the sensitized cell of step (c) with a compound; 
and (e) determining the degree to which said compound inhibits proliferation of said 
sensitized cell relative to a cell which does not contain said inhibitory nucleic acid. 

In a preferred embodiment, the invention relates to a method for identifying a 
compound having activity against a biological pathway required for proliferation 

20 comprising: (a) sensitizing a cell by providing a sublethal level of an antisense nucleic 
acid complementary to a nucleic acid encoding a gene product required for 
proliferation, wherein the activity or expression of said gene product is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, in said cell to 

25 reduce the activity or amount of said gene product; (b) contacting the sensitized cell 
with a compound; and (c) determining the degree to which said compound inhibits 
the growth of said sensitized ceil relative to a cell which does not contain said 
antisense nucleic acid. 

In a preferred embodiment, the invention relates to a method for identifying a 

30 compound having the ability to inhibit one of the Alloiococcus otitidis polypeptides 

encoded by a polynucleotide selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105, and which is essential for cellular proliferation 
comprising: (a) contacting a cell which expresses the polypeptide with the compound; 
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and (b) determining whether said compound reduces proliferation of said contacted 
cell by acting on said gene product 

In a preferred embodiment, the invention relates to a method for identifying a 
compound having the ability to inhibit one of the purified and isolated Alloiococcus 
5 otitidis polypeptides selected from one of the even numbered sequences set forth in 
Seq. ID No.: 2 to Seq. ID No.: 106, and which is essential for cellular proliferation 
comprising: (a) contacting the purified and isolated polypeptide with the compound in 
vitro in the presence or absence of a substrate, which is essential for the activity of 
the polypeptide; and (b) determining the effect of the compound on the polypeptide 
10 by measuring the effect of the polypeptide on the substrate. 

In a preferred embodiment, the invention relates to a compound which 
interacts with an Alloiococcus otitidis polypeptide selected from one of the even 
numbered sequences set forth in Seq. ID No.: 2 to Seq. ID No.: 106 and inhibits its 
activity. 

15 in a preferred embodiment, the invention relates to a method for 

manufacturing an antimicrobial compound comprising the steps of screening one or 
more candidate compounds to identify a compound that reduces the activity or level 
of an Alloiococcus otitidis polypeptide selected from one of the even numbered 
sequences set forth in Seq. ID No.: 2 to Seq. ID No.: 106, said polypeptide 

20 comprising a gene product whose activity or expression is inhibited by an antisense 
nucleic acid comprising a nucleotide sequence selected from one of the odd 
numbered sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105; and 
manufacturing the compound so identified. 

In a preferred embodiment, the invention relates to a compound which inhibits 

25 proliferation of Alloiococcus otitidis by interacting with a gene encoding a polypeptide 
that is required for proliferation or with a polypeptide required for proliferation, 
wherein said polypeptide is selected from the group consisting of a gene product 
having at least 70% nucleotide sequence identity from one of the odd numbered 
sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, polypeptide encoded by a 

30 nucleic acid having at least 70% nucleotide sequence identity to a nucleic acid 

encoding a polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from one of the odd numbered 
sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, a polypeptide having at 
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least 25% amino acid identity to a gene product whose expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected one of the odd 
numbered sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, a polypeptide 
encoded by a nucleic acid comprising a nucleotide sequence which hybridizes to a 

5 nucleic acid selected from one of the odd numbered sequences set forth in Seq. ID 
No.: 1 to Seq. ID No. 105 under stringent conditions, a gene product encoded by a 
nucleic acid comprising a nucleotide sequence which hybridizes to a nucleic acid 
selected from one of the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. 
ID No. 105 under moderate conditions, and a gene product whose activity may be 

10 complemented by the gene product whose activity is inhibited by a nucleic acid 

selected from one of the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. 
ID No. 105. 



15 



DETAILED DESCRIPTION OF THE INVENTION 



A. Definitions: 

By "biological pathway" is meant any discrete cell function or process that is 
carried out by a gene product or a subset of gene products. Biological pathways 
include anabolic, catabolic, enzymatic, biochemical and metabolic pathways as well 

20 as pathways involved in the production of cellular structures such as cell walls. 
Biological pathways that are usually required for proliferation of cells or 
microorganisms include, but are not limited to, cell division, DNA synthesis and 
replication, RNA synthesis (transcription), protein synthesis (translation), protein 
processing, protein transport, fatty acid biosynthesis, electron transport chains, cell 

25 wall synthesis, cell membrane production, synthesis and maintenance, and the like. 

By "inhibit activity of a gene or gene product" is meant having the ability to 
interfere with the function of a gene or gene product in such a way as to decrease 
expression of the gene, in such a way as to reduce the level or activity of a product of 
the gene or in such a way as to inhibit the interaction of the gene or gene product 

30 with other biological molecules required for its activity. 

Agents which inhibit the activity of a gene include agents that inhibit 
transcription of the gene, agents that inhibit processing of the transcript of the gene, 
agents that reduce the stability of the transcript of the gene, and agents that inhibit 
translation of the mRNA transcribed from the gene. In microorganisms, agents which 
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inhibit the activity of a gene can act to decrease expression of the operon in which 
the gene resides or alter the folding or processing of operon RNA so as to reduce the 
level or activity of the gene product. The gene product can be a non- translated RNA 
such as ribosomal RNA, a translated RNA (mRNA) or the protein product resulting 
5 from translation of the gene mRNA. Of particular utility to the present invention are 
antisense RNAs that have activities against the operons or genes to which they 

specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the 
function or to reduce the level or activity of the gene product in a cell. This includes, 
10 but is not limited to, inhibiting the enzymatic activity of the gene product or the ability 
of the gene product to interact with other biological molecules required for its activity, 
including inhibiting the gene product's assembly into a multimeric structure. 

By "activity against a protein" is meant having the ability to inhibit the function 
or to reduce the level or activity of the protein in a cell. This includes, but is not 
15 limited to, inhibiting the enzymatic activity of the protein or the ability of the protein to 
interact with other biological molecules required for its activity, including inhibiting the 
protein's assembly into a multimeric structure. 

By "activity against a nucleic acid" is meant having the ability to inhibit the 
function or to reduce the level or activity of the nucleic acid in a cell. This includes, 
20 but is not limited to, inhibiting the ability of the nucleic acid interact with other 

biological molecules required for its activity, including inhibiting the nucleic acid's 
assembly into a multimeric structure. 

By "activity against a gene" is meant having the ability to inhibit the function or 
expression of the gene in a cell. This includes, but is not limited to, inhibiting the 
25 ability of the gene to interact with other biological molecules required for its activity. 
By "activity against an operon" is meant having the ability to inhibit the function or 
reduce the level of one or more products of the operon in a cell. This includes, but is 
not limited to, inhibiting the enzymatic activity of one or more products of the operon 
or the ability of one or more products of the operon to interact with other biological 
30 molecules required for its activity. 

By "antibiotic" is meant an agent which inhibits the proliferation of a cell or 

microorganism. 
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By "homologous coding nucleic acid" is meant a nucleic acid homologous to a 
nucleic acid encoding a gene product whose activity or level is inhibited by a nucleic 
acid selected from the group consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105 or a 
portion thereof. In some embodiments, the homologous coding nucleic acid may 
5 have at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at least 

* 

70% nucleotide sequence identity to a nucleotide sequence selected from the group 
consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105 and fragments comprising at least 
10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 
nucleotides thereof. In other embodiments the homologous coding nucleic acids may 
10 have at least 97%, at least 5 95%, at least 90%, at least 85%, at least 80%, or at 
least 70% nucleotide sequence identity to a nucleotide sequence selected from the 
group consisting of the nucleotide sequences complementary to one of Seq ID Nos.: 
1 to Seq. ID Nos.: 105 and fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 
50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof, identity may 
15 be measured using BLASTN version 2.0 with the default parameters or tBLASTX 
with the default parameters. (Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A 
New Generation of Protein Database Search Programs, Nucleic Acid Res. 25: 3389- 
3402 (1997)) Alternatively a "homologuous coding nucleic acid" could be identified by 
membership of the gene of interest to a functional orthologue cluster. All other 
20 members of that orthologue cluster would be considered homologues. Such a library 
of functional orthologue clusters can be found at hltp://www.nebi.nlm.nib.gov/COG. A 
gene can be classified into a cluster of orthologous groups or COG by using the 
COGNITOR program available at the above web site, or by direct BLASTP 
comparison of the gene of interest to the members of the COGs and analysis of 
25 these results as described by Tatusov, R.L., Galperin, M.Y., Natale, D. A. and 

Koonin, E.V. (2000) The COG database: a tool for genome- scale analysis of protein 
functions and evolution. Nucleic Acids Research v. 2 8 n. 1 , pp3 3 -3 6. 

The term "homologous coding nucleic acid" also includes nucleic acids 
comprising nucleotide sequences which encode polypeptides having at least 99%, 
30 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at least 
50%, at least 40% or at least 25% amino acid identity or similarity to a polypeptide 
comprising the amino acid sequence of one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 or 
to a polypeptide whose expression is inhibited by a nucleic acid comprising a 
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nucleotide sequence of one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 or fragments 
comprising at least 5, 10, 15, 20, 25, 30,35, 40, 50, 75, 100, or 150 consecutive 
amino acids thereof as determined using the FASTA version 3.CK78 algorithm with 
the default parameters. Alternatively, protein identity or similarity may be identified 
5 using BLASTP with the default parameters, BLASTX with the default parameters, 
TBLASTN with the default parameters, or tBLASTX with the default parameters. 
(Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of Protein 
Database Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997)). 

The term "homologous coding nucleic acid" also includes coding nucleic acids 
10 which hybridize under stringent conditions to a nucleic acid selected from the group 
consisting of the nucleotide sequences complementary to one of Seq ID Nos.: 1 to 
Seq. ID Nos.: 105 and coding nucleic acids comprising nucleotide sequences which 
hybridize under stringent conditions to a fragment comprising at least 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 
15 sequences complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105. 

As used herein, "stringent conditions" means hybridization to filter-bound 
nucleic acid in 6xSSC at about 45*C followed by one or more washes in 0. lxSSC/0.2/ 
SDS at about 680C. Other exemplary stringent conditions may refer, e.g., to washing 
in 6xSSC/0.05% sodium pyrophosphate at 37C, 48'C, 55'C, and 60'C as appropriate 

20 for the 5 particular probe being used. 

The term "homologous coding nucleic acid" also includes coding nucleic acids 
comprising nucleotide sequences which hybridize under moderate conditions to a 
nucleotide sequence selected from the group consisting of the sequences 
complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 and coding nucleic 

25 acids comprising nucleotide sequences which hybridize under moderate conditions to 
a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 
150,200,300,400, or 500 consecutive nucleotides of the sequences complementary 
to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105. As used herein, "moderate conditions" 
means hybridization to filter-bound DNA in 6x sodium chloride/sodium citrate (SSC) 

30 at about 45'C followed by one or more washes in 0.2xSSC/0. 1 % SDS at about 42- 
65'C. 

The term "homologous coding nucleic acids" also includes nucleic acids 
comprising nucleotide sequences which encode a gene product whose activity may 
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be complemented by a gene encoding a gene product whose activity is inhibited by a 
nucleic acid comprising a nucleotide sequence selected from the group consisting of 
Seq ID Nos.: 1 to Seq. ID Nos.: 105. In some embodiments, the homologous coding 
nucleic acids may encode a gene product whose activity is complemented by the 

5 gene product encoded by a nucleic acid comprising a nucleotide sequence selected 
from the group consisting Seq ID Nos.: 1 to Seq. ID Nos.: 105. In other 
embodiments, the homologous coding nucleic acids may comprise a nucleotide 
sequence encodes a gene product whose activity is complemented by one of the 
polypeptides of Seq ID Nos.: 1 to Seq. ID Nos.: 105 . 

l0 The term "homologous antisense nucleic acid" includes nucleic acids 

comprising a nucleotide sequence having at least 97%, at least 95%, at least 90%, at 
least 85%, at least 80%, or at least 70% nucleotide sequence identity to a nucleotide 
sequence selected from the group consisting of one of the sequences of Seq ID 
Nos.: 1 to Seq. ID Nos.: 105 and fragments comprising at least 10, 15, 20, 25, 

15 30,35,40, 50, 75, 1 00, 1 50, 200,300,400, or 500 consecutive nucleotides thereof. 
Homologous antisense nucleic acids may also comprising nucleotide sequences 
which have at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at 
least 70% nucleotide sequence identity to a nucleotide sequence selected from the 
group consisting of the sequences complementary to one of sequences of Seq ID 

20 Nos.: 1 to Seq. ID Nos.: 105 and fragments comprising at least 10, 15, 20, 25, 30, 35, 
40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides thereof. 

Nucleic acid identity may be determined as described above. 
The term "homologous antisense nucleic acid" also includes antisense nucleic acids 
comprising nucleotide sequences which hybridize under stringent conditions to a 

25 nucleotide sequence complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105 
and antisense nucleic acids comprising nucleotide sequences which hybridize under 
stringent conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 
75, 100, 150,200, 300, 400, or 500 consecutive nucleotides of the sequence 
complementary to one Seq ID Nos.: 1 to Seq. ID Nos.: 105. Homologous antisense 

30 nucleic acids also include antisense nucleic acids comprising nucleotide sequences 
which hybridize under stringent conditions to a nucleotide sequence selected from 
the group consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105, and antisense nucleic 
acids comprising nucleotide sequences which hybridize under stringent conditions to 
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a fragment comprising at least 10, 15, 20,25, 30, 35, 40, 50, 75, 
100,150,200,300,400, or 500 consecutive nucleotides of one of Seq ID Nos.: 1 to 

Seq. ID Nos.: 105. 

The term "homologous antisense nucleic acid" also includes antisense 

5 nucleic acids comprising nucleotide sequences which hybridize under moderate 

conditions to a nucleotide sequence complementary to one of Seq ID Nos.: 1 to Seq. 
ID Nos.: 105 and antisense nucleic acids comprising nucleotide sequences which 
hybridize under moderate conditions to a fragment comprising at least 10, 15, 20, 25, 
30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of the 

10 sequence complementary to one of Seq ID Nos.: 1 to Seq. ID Nos.: 105. 

Homologous antisense nucleic acids also include antisense nucleic acids comprising 
nucleotide sequences which hybridize under moderate conditions to a nucleotide 
sequence selected from the group consisting of Seq ID Nos.: 1 to Seq. ID Nos.: 105 
and antisense nucleic acids which comprising nucleotide sequences hybridize under 

15 moderate conditions to a fragment comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 
75, 100, 150, 200, 300, 400, or 500 consecutive nucleotides of one of Seq ID Nos.: 1 

to Seq. ID Nos.: 105. 

By "homologous polypeptide" is meant a polypeptide homologous to a 
polypeptide whose activity or level is inhibited by a nucleic acid comprising a 

20 nucleotide sequence selected from the group consisting of Seq ID Nos.: 1 to Seq. ID 
Nos.: 105 by a homologous antisense nucleic acid. The term "homologous 
polypeptide" includes polypeptides having at least 99%, 95%, at least 90%, at least 
85%, at least 80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 
25% amino acid identity or similarity to a polypeptide whose activity or level is 

25 inhibited by a nucleic acid selected from the group consisting of Seq ID Nos.: 1 to 

Seq. ID Nos.: 105 or by a homologous antisense nucleic acid, or polypeptides having 
at least 99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 
60%, at least 50%, at least 40% or at least 25% amino acid identity or similarity to a 
polypeptide to a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 

30 1 00, or 1 50 consecutive amino acids of a polypeptide whose activity or level is 
inhibited by a nucleic acid selected from the group consisting of Seq ID Nos.: 1 to 
Seq. ID Nos.: 105 or by a homologous antisense nucleic acid. Identity or similarity 
may be determined using the FASTA version 3. Ot78 algorithm with the default 
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parameters. Alternatively, protein identity or similarity may be identified using 
BLASTP with the default parameters, BLASTX with the default parameters, or 
TBLASTN with the default parameters. (Altschul, S.R et ai. Gapped BLAST and PSI- 
BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid Res. 

5 25:3389-3402(1997). 

The term homologous polypeptide also includes polypeptides having at least 
99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at least 60%, at 
least 50%, at least 40% or at least 25% amino acid identity or similarity to a 
polypeptide selected from the group consisting of Seq ID Nos.: 2 to Seq. ID Nos.: 

10 106 and polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 
80%, at least 70%, at least 60%, at least 50%, at least 40% or at least 25% amino 
acid identity or similjarity to a fragment comprising at least 5, 10, 15, 20, 25, 30, 35, 
40, 5 0, 75, 100, or 150 consecutive amino acids of a polypeptide selected from the 
group consisting of Seq ID Nos.: 2 to Seq. ID Nos.: 106. 

15 The invention also includes polynucleotides, preferably DNA molecules, that 

hybridize to one of the nucleic acids of Seq ID Nos.: 2 to Seq. ID Nos.: 106 or the 
complements of any of the preceding nucleic acids. Such hybridization may be under 
stringent or moderate conditions as defined above or under other conditions which 
permit specific hybridization. The nucleic acid molecules of the invention that 

20 hybridize to these DNA sequences include oligodeoxynucleotides ("oligos 0 ) which 
hybridize to the target gene under highly stringent or stringent conditions. In general, 
for oiigos between 14 and 70 nucleotides in length the melting temperature (Tm) is 
calculated using the formula: 

Tm ff) = 81.5 + 16.6(!og[monova!ent cations (molar)] + 0.41 (%G+Q - (500N) 

25 where N is the length of the probe. If the hybridization is carried out in a solution 

containing formamide, the melting temperature may be calculated using the equation: 

Tm('C) = 81 .5 + 1 6.6(log[monovalent cations (niolar)] + 0.4 1 (% G+C) - (0.6 
1) (% formamide) - (SOON) where N is the length of the probe. In general, 
hybridization is carried out at about 20-25 degrees below Tin (for DNA-DNA hybrids) 

30 or about 10-15 degrees below Tin (for RNA-DNA hybrids). 

Other hybridization conditions are apparent to those of skill in the art (see, for 
example, Ausubel, F.M. et al., eds., 1989, Current Protocols in Molecular Biology, 
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Vol. 1, Green Publishing Associates, Inc. and John Wiley & Sons, Inc., New York, at 
pp. 6.3.1-6.3.6 and 2.10.3. 

By "identifying a compound" is meant to screen one or more compounds in a 
collection of compounds such as a combinatorial chemical library or other library of 

5 chemical compounds or to characterize a single compound by testing the compound 
in a given assay and determining whether it exhibits the desired activity. 

By "inducer" is meant an agent or solution which, when placed in contact with 
a cell or microorganism, increases transcription, or inhibitor and/or promoter 
clearance/fidelity, from a desired promoter. 

10 As used herein, "nucleic acid" means DNA, RNA, or modified nucleic acids. 

Thus, the terminology/'the nucleic acid of SEQ ID NO: V or "the nucleic acid 
comprising the nucleotide sequence 0 includes both the DNA sequence of SEQ ID 
NO: X and an RNA sequence in which the thymidines in the DNA sequence have 
been substituted with uridines in the RNA sequence and in which the deoxyribose 

15 backbone 'of the DNA sequence has been substituted with a ribose backbone in the 
RNA sequence. Modified nucleic acids are nucleic acids having nucleotides or 
structures which do not occur in nature, such as nucleic acids in which the 
internucleotide phosphate residues with methylphosphonates, phosphorothioates, 
phosphoramidates, and phosphate esters. Nonphosphate internucleotide analogs 

20 such as siloxane bridges, carbonate bridges, thioester bridges, as well as many 

others known in the art may also be used in modified nucleic acids. Modified nucleic 
acids may also comprise, (x-anomeric nucleotide units and modified micleotides such 
as 1 2 dideoxy-d-ribofuranose, 1,2-dideoxy- 1 -phenylribofuranose, and N4, N4- 
ethano-5 -methyl-cytosine are contemplated for use in the present invention. 

25 Modified nucleic acids may also be peptide nucleic acids in which the entire 

deoxyribose-phosphate backbone has been exchanged with a chemically completely 
different, but structurally homologous, polyamide (peptide) backbone containing 2- 
aminoethyl glycogen units. 

As used herein, "sub-lethal" means a concentration of an agent below the 

30 concentration required to inhibit all cell growth. 

A proliferation-required gene or gene family is one where, in the absence or 
substantial reduction of a gene transcript and/or gene product, growth or viability of 
the cell or microorganism is reduced or eliminated. Thus, as used herein, the 
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terminology "proliferation- required" or "required for proliferation" encompasses 
instances where the absence or substantial reduction of a gene transcript and/or 
gene product completely eliminates cell growth as well as instances where the 
absence of a gene transcript and/or gene product merely reduces cell growth. These 

5 proliferation-required genes can be used as potential targets for the generation of 
new antimicrobial agents. To achieve that goal, the present invention also 
encompasses assays for analyzing proliferation- required genes and for identifying 
compounds which interact with the gene and/or gene products of the proliferation- 
required genes. In addition, the present invention contemplates the expression of 

10 genes and the purification of the proteins encoded by the nucleic acid sequences 
identified as required proliferation genes and reported herein. The purified proteins 
can be used to generate reagents and screen small molecule libraries or other 
candidate compound libraries for compounds that can be further developed to yield 
novel antimicrobial compounds. 

15 The invention described herein addresses the need for identifying 

Alloiococcus otitidis proliferation-required gene or gene family that may be used to 
identify compounds, which are effective in preventing or treating most or all of the 
disease caused by Alloiococcus otitidis. The invention further addresses the need for 
methods of diagnosing Alloiococcus otitidis infection using the genes and the 

20 polypeptides identified herein. The inventors have identified novel Alloiococcus 
otitidis open reading frames (Ors), which encode proteins/polypeptides that are 
essential for the growth and proliferation of the bacteria. More particularly, the newly 
identified Ors encode polypeptides that are essential for proliferation of Alloiococcus 
otitidis, and thus serve as potential targets for antimicrobial compounds. Thus, in 

25 certain embodiments, the invention comprises Alloiococcus otitidis Ors encoding 

polypeptides that are essential for cellular proliferation, transcription gene products of 
Alloiococcus otitidis Ors, including, but not limited to mRNA, antisense RNA, 
antisense oligonucleotides, and ribozyme molecules, which can be used to inhibit or 
control growth of the microorganism. The invention relates also to methods of 

30 detecting Alloiococcus otitidis nucleic acids or polypeptides and kits for diagnosing 
Alloiococcus otitidis infection. The invention also relates to pharmaceutical 
compositions, in particular antimicrobial compounds in pharmaceutical compositions, 
for the prevention and/or treatment of bacterial infection, in particular infection 
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caused by or exacerbated by Alloiococcus otitidis. 

B. Alloiococcus otitidis ORF Polynucleotides Encoding Polypeptides 
Essential for Proliferation 

5 

Isolated and purified Alloiococcus otitidis ORF polynucleotides of the present 
invention are contemplated for use in the production of Alloiococcus otitidis 
polypeptides. More specifically, in certain embodiments, the ORFs encode 
Alloiococcus otitidis polypeptides that are essential for cell proliferation. Thus, in one 

10 aspect, the present invention provides isolated and purified polynucleotides (ORFs) 
that encode Alloiococcus otitidis essential for cell proliferation. In particular 
embodiments, a polynucleotide of the present invention is a DNA molecule, wherein 
the DNA may be genomic DNA, plasmid DNA or cDNA. In a preferred embodiment, 
a polynucleotide of the present invention is a recombinant polynucleotide, which 

15 encodes an Alloiococcus otitidis polypeptide comprising an amino acid sequence that 
has at least 25% identity to an amino acid sequence of one of even numbered 
sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106 or a fragment thereof. 
In another embodiment, an isolated and purified ORF polynucleotide comprises a 
nucleotide sequence that has at least 70% identity to one of the ORF polynucleotide 

20 nucleotide sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 105, a 

degenerate variant thereof, or a complement thereof. In yet another embodiment, an 
ORF polynucleotide of one of SEQ ID NO: 1 through SEQ ID NO: 105 is comprised 
in a plasmid vector and expressed in a host cell. In a preferred embodiment, the host 
cell is a prokaryotic host cell. 

25 As used herein, the term "polynucleotide" means a sequence of nucleotides 

connected by phosphodiester linkages. Polynucleotides are presented herein in the 
direction from the 5 1 to the 3' direction. A polynucleotide of the present invention can 
comprise from about 10 to about several hundred thousand base pairs. Preferably, a 
polynucleotide comprises from about 10 to about 3,000 base pairs. Preferred lengths 

30 of particular polynucleotide are set forth hereinafter. 

A polynucleotide of the present invention can be a deoxyribonucleic acid 
(DNA) molecule, a ribonucleic acid (RNA) molecule, or analogs of the DNA or RNA 
generated using nucleotide analogs. The nucleic acid molecule can be single- 
stranded or double-stranded, but preferably is double-stranded DNA. Where a 
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polynucleotide is a DNA molecule, that molecule can be a gene, a cDNA molecule or 
a genomic DNA molecule. Nucleotide bases are indicated herein by a single letter 
code: adenine (A), guanine (G), thymine (T) and cytosine (C). 

"Isolated" means altered "by the hand of man" from the natural state. An 
5 "isolated" composition or substance is one that has been changed or removed from 
its original environment, or both. For example, a polynucleotide or a polypeptide 
naturally present in a living animal is not "isolated," but the same polynucleotide or 
polypeptide separated from the coexisting materials of its natural state is "isolated," 
as the term is employed herein. 

10 Preferably, an "isolated" polynucleotide is free of sequences which naturally 

flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the nucleic 
acid) in the genomic DNA of the organism from which the nucleic acid is derived. For 
example, in various embodiments, the isolated Alloiococcus otitidis nucleic acid 
molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0. 5 kb or 0. 1 kb of 

15 nucleotide sequences which naturally flank the nucleic acid molecule in genomic 
DNA of the cell from which the nucleic acid is derived. However, the Alloiococcus 
otitidis nucleic acid molecule can also be fused to heterologous protein encoding or 
regulatory sequences and still be considered isolated. 

ORF polynucleotides of the present invention may also be obtained using 

20 standard cloning and screening techniques from a cDNA library derived from mRNA. 
Polynucleotides of the invention can also be obtained from natural sources such as 
genomic DNA libraries (e.g., an Alloiococcus otitidis library) or can be synthesized 
using well-known and commercially available techniques. As contemplated in the 
present invention, ORF polynucleotides are obtained using Alloiococcus otitidis 

25 chromosomal DNA as the template. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences set forth in the odd numbered sequences listed in ID NO: 1 
through SEQ ID NO: 105 (and fragments thereof) due to degeneracy of the genetic 
code, and thus encode the same Alloiococcus otitidis polypeptides as those encoded 

30 by the amino acid sequences shown in even numbered sequences set forth in SEQ 
ID NO:2 through SEQ ID NO: 106 

Orthologs and allelic variants of the Alloiococcus otitidis polynucleotides are 
readily identified using methods well known in the art. An allelic variant or an 
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orthologue of the polynucleotides comprises a nucleotide sequence that is typically at 
least about 70-75%, more typically at least about 80-85%, and most typically at least 
about 90-95% or more homologous to the nucleotide sequence shown in one of the 
odd numbered sequences set forth in SEQ ID NO:1 through SEQ ID NO: 105, or a 

5 fragment of these nucleotide sequences. Such nucleic acid molecules are readily 
identified as being able to hybridize, preferably under stringent conditions, to the 
nucleotide sequence shown in one of the odd numbered sequences set forth in SEQ 
ID NO:1 through SEQ ID NO: 1 05, or a fragment of these nucleotide sequences. 

Moreover, the polynucleotides of the invention can comprise only a fragment 

10 of the coding region of an Alloiococcus otitidis polynucleotide or gene, such as a 
fragment of one of the odd numbered sequences set forth in SEQ ID NO:1 through 
SEQ ID NO: 105. 

When the ORF polynucleotides of the invention are used for the recombinant 
production of Alloiococcus otitidis polypeptides of the present invention, the 

15 polynucleotide may include the coding sequence for the mature polypeptide, by itself, 
or the coding sequence for the mature polypeptide in reading frame with other coding 
sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- 
or prepro- protein sequence, or other fusion peptide portions. For example, a marker 
sequence which facilitates purification of the fused polypeptide can be linked to the 

20 coding sequence (seeGentz et a/., 1989, incorporated herein by reference). Thus, 
contemplated in the present invention is the preparation of polynucleotides encoding 
fusion polypeptides permitting His-tag purification of expression products. The 
polynucleotide may also contain non-coding 5* and 3* sequences, such as 
transcribed, non-translated sequences, splicing and polyadenylation signals. 

25 Thus, a polynucleotide encoding a polypeptide of the present invention, 

including homologs and orthologs from species other than Alloiococcus otitidis, may 
be obtained by a process which comprises the steps of screening an appropriate 
library under stringent hybridization conditions with a labeled probe having the 
sequence of one of the odd numbered sequences set forth in SEQ ID NO:1 through 

30 SEQ ID NO: 1 05 or a fragment thereof; and isolating full-length cDNA and genomic 
clones containing the polynucleotide sequence. Such hybridization techniques are 
well known to the skilled artisan. The skilled artisan will appreciate that, in many 
cases, an isolated cDNA sequence will be incomplete, in that the region coding for 
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the polypeptide is cut short at the 5" end of the cDNA. This is a consequence of 
reverse transcriptase, an enzyme with inherently low "processivity" (a measure of the 
ability of the enzyme to remain attached to the template during the polymerization 
reaction), failing to complete a DNA copy of the mRNA template during the first- 

5 strand cDNA synthesis. 

Thus, in certain embodiments, the polynucleotide sequence information 
provided by the present invention allows for the preparation of relatively short DNA 
(or RNA) oligonucleotide sequences having the ability to specifically hybridize to 
gene sequences of the selected polynucleotides disclosed herein. The term 
10 "oligonucleotide" as used herein is defined as a molecule comprised of two or more 
deoxyribonucleotides or ribonucleotides, usually more than three (3), and typically • 
more than ten (1 0) and up to one hundred (100) or more (although preferably 
between twenty and thirty). The exact size will depend on many factors, which in 
turn depends on the ultimate function or use of the oligonucleotide. Thus, in 
15 particular embodiments of the invention, nucleic acid probes of an appropriate length 
are prepared based on a consideration of a selected nucleotide sequence, e.g., a 
sequence such as that shown in one of the odd numbered sequences set forth in 
SEQ ID NO:1 through SEQ ID NO: 105. The ability of such nucleic acid probes to 
specifically hybridize to a polynucleotide encoding an AHoiococcus otitidis 
20 polypeptide lends them particular utility in a variety of embodiments. Most 

importantly, the probes can be used in a variety of assays for detecting the presence 
of complementary sequences in a given sample. 

In certain embodiments, it is advantageous to use oligonucleotide primers. 
These primers are generated in any manner, including chemical synthesis, DNA 
25 replication, reverse transcription, or a combination thereof. The sequence of such 
primers is designed using a polynucleotide of the present invention for use in 
detecting, amplifying or mutating a defined segment of an ORF polynucleotide that 
encodes an AHoiococcus otitidis polypeptide from prokaryotic cells using polymerase 
chain reaction (PCR) technology. 
30 In certain embodiments, it is advantageous to employ a polynucleotide of the 

present invention in combination with an appropriate label for detecting hybrid 
formation. A wide variety of appropriate labels are known in the art, including 
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radioactive, enzymatic or other iigands, such as avidin/biotin, which are capable of 
giving a detectable signal. 

Polynucleotides which are identical or sufficiently identical to a nucleotide 
sequence contained in one of the odd numbered sequences set forth in SEQ ID 

5 NO:1 through SEQ ID NO: 105, or a fragment thereof, may be used as hybridization 
probes for cDNA and genomic DNA or as primers for a nucleic acid amplification 
(PCR) reaction, to isolate full-length cDNAs and genomic clones encoding 
polypeptides of the present invention and to isolate cDNA and genomic clones of 
other genes (including genes encoding homologs and orthologs from species other 

10 than AHoiococcus otitidis) that have a high sequence similarity to polynucleotide 

sequences set forth in one of the odd numbered sequences set forth in SEQ ID NO:1 
through SEQ ID NO: 105, or a fragment thereof. Typically these nucleotide 
sequences are from at least 70% identical to at least about 95% identical to that of 
the reference polynucleotide sequence. The probes or primers will generally 

15 comprise at least 15 nucleotides, preferably, at least 30 nucleotides and may have at 
least 50 nucleotides. Particularly preferred probes will have between 30 and 50 
nucleotides. 

There are several methods available and well known to those skilled in the art 
to obtain full-length cDNAs, or extend short cDNAs, for example those based on the 

20 method of Rapid Amplification of cDNA ends (RACE) (see, Frohman et aL, 1 988). 
Recent modifications of the technique, exemplified by the Marathon™ technology 
[Promega, Madison, Wl], for example, have significantly simplified the search for 
longer cDNAs. In the Marathon™ technology, cDNAs have been prepared from 
mRNA extracted from a chosen tissue and an "adaptor" sequence ligated onto each 

25 end. Nucleic acid amplification (PCR) is then carried out to amplify the "missing" 5' 
end of the cDNA using a combination of gene specific and adaptor specific 
oligonucleotide primers. The PCR reaction is then repeated using "nested" primers, 
that is, primers designed to anneal within the amplified product (typically an adaptor 
specific primer that anneals further 3' in the adaptor sequence and a gene specific 

30 primer that anneals further 5 1 in the known gene sequence). The products of this 
reaction are then analyzed by DNA sequencing and a full-length cDNA constructed 
either by joining the product directly to the existing cDNA to give a complete 
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sequence, or carrying out a separate full-length PCR using the new sequence 
information for the design of the 5' primer. 

To provide certain of the advantages in accordance with the present 
invention, a preferred nucleic acid sequence employed for hybridization studies or 
5 assays includes probe molecules that are complementary to at least a 1 0 to about 70 
nucleotides long stretch of a polynucleotide that encodes an Alloiococcus otitidis 
polypeptide, such as that shown in one of the even numbered sequences set forth in 
SEQ ID NO: 2 through SEQ ID NO: 106. A size of at least 1 0 nucleotides in length 
helps to ensure that the fragment will be of sufficient length to form a duplex 
10 molecule that is both stable and selective. Molecules having complementary 

sequences over stretches greater than 10 bases in length are generally preferred in 
order to increase stability and selectivity of the hybrid, and thereby improve the 
quality and degree of specific hybrid molecules obtained. It is generally preferable to 
design nucleic acid molecules with gene-complementary stretches of 25 to 40 
15 nucleotides, 55 to 70 nucleotides, or even longer where desired. For example, such 
fragments are readily prepared by directly synthesizing the fragment by chemical 
means, by application of nucleic acid reproduction technology, such as the PCR 
technology (U.S. Patent 4,683,202, incorporated herein by reference), or by excising 
selected DNA fragments from recombinant plasmids containing appropriate inserts 
20 and suitable restriction enzyme sites. 

In another aspect, the present invention contemplates an isolated and purified 
polynucleotide comprising a nucleotide sequence that is identical or complementary 
to a segment of at least 10 contiguous bases of one of the odd numbered sequences 
set forth in SEQ ID NO: 1 through SEQ ID NO: 105, wherein the polynucleotide 
25 hybridizes to a polynucleotide that encodes an Alloiococcus otitidis polypeptide. 

Preferably, the isolated and purified polynucleotide comprises a base sequence that 
is identical or complementary to a segment of at least 25 to 70 contiguous bases of 
one of the odd numbered sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 
105. For example, the polynucleotide of the invention can comprise a segment of 
30 bases identical or complementary to from 40 to 55 contiguous bases of the disclosed 
nucleotide sequences. 

Accordingly, a polynucleotide probe molecule of the invention can be used for 
its ability to selectively form duplex molecules with complementary stretches of the 
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gene. Depending on the application envisioned, varying conditions of hybridization 
are employed to achieve varying degrees of selectivity of the probe toward the target 
sequence. For applications requiring a high degree of selectivity, relatively stringent 
conditions are employed to form the hybrids. Of course, for some applications, for 
5 example, where one desires to prepare mutants employing a mutant primer strand 
hybridized to an underlying template or where one seeks to isolate an Alloiococcus 
ofttidis homologous polypeptide coding sequence from other cells, functional 
equivalents, or the like, less stringent hybridization conditions are typically needed to 
allow formation of the heteroduplex (see Table 2). Cross-hybridizing species are 
10 thereby readily identified as positively hybridizing signals with respect to control 

hybridizations. Thus, hybridization conditions are readily manipulated, and thus will 
generally be a method of choice depending on the desired results. 

Of course, for some applications, for example, where one desires to prepare 
mutants employing a mutant primer strand hybridized to an underlying template or 
15 where one seeks to isolate a homologous polypeptide coding sequence from other 
cells, functional equivalents, or the like, less stringent hybridization conditions are 
typically needed to allow formation of the heteroduplex. Cross-hybridizing species 
are thereby readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be 
20 rendered more stringent by the addition of increasing amounts of formamide, which 
serves to destabilize the hybrid duplex in the same manner as increased 
temperature. Thus, hybridization conditions are readily manipulated, and thus are 
generally a method of choice depending on the desired results. 

The present invention also includes polynucleotides capable of hybridizing 
25 under reduced stringency conditions, more preferably stringent conditions, and most 
preferably highly stringent conditions, to polynucleotides described herein. Examples 
of stringency conditions are shown in the table below: highly stringent conditions are 
those that are at least as stringent as, for example, conditions A-F; stringent 
conditions are at least as stringent as, for example, conditions G-L; and reduced 
30 stringency conditions are at least as stringent as, for example, conditions M-R. 
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Table 2 
Stringency Conditions 



Qtrinnonru 

oiriiiytJiiuy 

Condition 


Polynucleotide 
Hybrid 


Hybrid Length 
(bp) 1 


Hybridization 
Temperature and 
Buffer" 


Wash Temperature and 
BufferH 


A 
r\ 


DNA-DNA 


> 50 


65°C; 1xSSC -or- 
42 °C; 1xSSC, 50% 
formamide 


65 °C; 0.3XSSC 


B 


DNAiDNA 


<50 


T B ; 1xSSC 


T B ; 1xSSC 


C 


DNA:RNA 


>50 


67 °C; 1xSSC -or- 
45 °C; 1xSSC, 50% 
formamide 


67 °C; 0.3xSSC 


D 


DNA:RNA 


<50 


T D ; 1xSSC 


T D ; 1xSSC 


E 


RNAiRNA 


>50 


70 °C; 1xSSC -or- 
50 °C; 1xSSC, 50% 
formamide 


70 °C; 0.3XSSC 


F 


RNA:RNA 


<50 


Tf; 1xSSC 


T P ; 1xSSC 


G 


DNA:DNA 


>50 


65°C;4xSSC -or- 
42 °C; 4xSSC,50% 
formamide 


65 °C; 1xSSC 


H 


DNA:DNA 


< 50 


T H ; 4xSSC 


T H ; 4xSSC 


1 


HM A .DM A 




67 °C- 4xSSC -or- 
45°C;4xSSC, 50% 
formamide 


67 °C; "IxSSC 


J 


DNA:RNA 


<50 


Tj; 4xSSC 


Tj; 4xSSC 


K 


RNA:RNA 


> 5U 


formamide 


67 °C' 1xSSC 

• 


L 


RNArRNA 


<50 


T L ; 2xSSC 


T L ; 2xSSC 


M 


DNA:DNA 


>50 


50 °C; 4xSSC -or- 
40°C;6xSSC,50% 
formamide 


50 °C; 2xSSC 


N 


DNA:DNA 


<50 


T N ; 6xSSC 


Tm; 6xSSC 


O 


DNA:RNA 


>50 


55 °C; 4xSSC -or- 
42 °C; 6xSSC, 50% 
formamide 


55°C;2xSSC 


P 


DNArRNA 


<50 


T P ; 6xSSC 


T P ; 6xSSC 


Q 


RNAiRNA 


>50 


60 °C; 4xSSC -or- 
45 °C; 6xSSC, 50% 
formamide 


60°C;2xSSC 


R 


RNAiRNA 


<50 


T R ; 4xSSC 


T r ; 4xSSC 



(bp) 1 : The hybrid length is that anticipated for the hybridized region(s) of the 
hybridizing polynucleotides. When hybridizing a polynucleotide to a target / 
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polynucleotide of unknown sequence, the hybrid length is assumed to be that of the 
hybridizing polynucleotide. When polynucleotides of known sequence are 
hybridized, the hybrid length can be determined by aligning the sequences of the 
polynucleotides and identifying the region or regions of optimal sequence 

5 complementarity. 

Buffer": SSPE (IxSSPE is 0.15M NaCI, 10mM NaH 2 P0 4 , and 1.25mM EDTA, 
pH 7.4), can be substituted for SSC (1xSSC is 0.1 5M NaCI and 15mM sodium 
citrate) in the hybridization and wash buffers; washes are performed for 15 minutes 
after hybridization is complete. 

10 T B through T R : The hybridization temperature for hybrids anticipated to be 

less than 50 base pairs in length should be 5-1 OEC less than the melting temperature 
(T m ) of the hybrid, where T m is determined according to the following equations. For 
hybrids less than 1 8 base pairs in length, T m (EC) = 2(# of A + T bases) + 4(# of G + 
C bases). For hybrids between 18 and 49 base pairs in length, T m (EC) = 81 .5 + 

15 1 6.6(log 10 [Na + ]) + 0.41 (%G+C) - (600/N), where N is the number of bases in the 

hybrid, and [Na + ] is the concentration of sodium ions in the hybridization buffer ([Na + ] 

for 1xSSC = 0.165 M). 

Additional examples of stringency conditions for polynucleotide hybridization 
are provided in Sambrook et a/., 1989, Molecular Cloning: A Laboratory Manual, Cold 

20 Spring Harbor Laboratory Press, Cold Spring Harbor, NY, chapters 9 and 1 1 , and 
Ausubel et al., 1 995, Current Protocols in Molecular Biology, Eds., John Wiley & 
Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

In addition to the nucleic acid molecules encoding Alfoiococcus otitidis 
polypeptides described above, another aspect of the invention pertains to isolated 

25 nucleic acid molecules that are antisense thereto. An "antisense" nucleic acid 

comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense 
nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid 

30 can be complementary to an entire Alloiococcus otitidis coding strand, or to only a 
fragment thereof. In one embodiment, an antisense nucleic acid molecule is 
antisense to a "coding region" of the coding strand of a nucleotide sequence 
encoding an Alloiococcus otitidis polypeptide. 
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The term "coding region" refers to the region of the nucleotide sequence 
comprising codons which are translated into amino acid residues, e.g., the entire 
coding region of each of the odd numbered sequences set forth in SEQ ID NO: 1 
through SEQ ID NO: 105. In another embodiment, the antisense nucleic acid 
5 molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
sequence encoding an Alloiococcus otitidis polypeptide. The term "noncoding 
region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids {i.e., also referred to as 5' and 3' untranslated regions). 
Given the coding strand sequence encoding the Alloiococcus otiticlis 
10 polypeptides disclosed herein antisense nucleic acids of the invention can be 

designed according to the rules of Watson and Crick base pairing. The antisense 
nucleic acid molecule can be complementary to the entire coding region of 
Alloiococcus otitidis mRNA, but more preferably is an oligonucleotide which is 
antisense to only a fragment of the coding or noncoding region of Alloiococcus otitidis 
15 mRNA. For example, the antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of Alloiococcus otitidis mRNA. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 
35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic ligation reactions using 
20 procedures known in the art. For example, an antisense nucleic acid (e.g., an 

antisense oligonucleotide) can be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives 
25 and acridine substituted nucleotides can be used. Examples of modified nucleotides 
which can be used to generate the antisense nucleic acid include 5-fluorouracil, 5- 
bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetyicytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracii, beta-D-galactosylqueosine, inosine, 
30 N6-isopentenyladenine, l-methylguanine, l-methylinosine, 2,2-dimethylguanine, 2- 

methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracii, 
beta-D-mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2- 

-29- 



WO 03/104391 



PCT/US02/36122 



methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- 
thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid 
(v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

5 diaminopurine. 

Alternatively, the antisense nucleic acid can be produced biologically using an 
expression vector into which a nucleic acid has been subcloned in an antisense 
orientation {i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the 

10 following subsection). 

The antisense nucleic acid molecules of the invention are typically 
administered to a subject or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding an Alfoiococcus otitidis polypeptide to 
thereby inhibit expression of the polypeptide, e.g., by inhibiting transcription and/or 

15 translation. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific interactions in the major 
groove of the double helix. An example of a route of administration of an antisense 
nucleic acid molecule of the invention includes direct injection at a tissue site. 

20 Alternatively, an antisense nucleic acid molecule can be modified to target selected 
cells and then administered systemically. For example, for systemic administration, 
an antisense molecule can be modified such that it specifically binds to a receptor or 
an antigen expressed on a selected cell surface, e.g., by linking the antisense nucleic 
acid molecule to a peptide or an antibody which binds to a cell surface receptor or 

25 antigen. The antisense nucleic acid molecule can also be delivered to cells using the 
vectors described herein. 

In yet another embodiment, the antisense nucleic acid molecule of the 
invention is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid 
molecule forms specific double-stranded hybrids with complementary RNA in which, 

30 contrary to the usual y-units, the strands run parallel to each other (Gaultier et al., 
1987). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide (Inoue et al, 1987) or a chimeric RNA-DNA analogue (Inoue et 
al., 1987). 
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In stili another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 

5 described in Haselhoff and Gerlach, 1 988) can be used to catalytically cleave 

Alloiococcus otitidis mRNA transcripts to thereby inhibit translation of Alloiococcus 
otitidis mRNA. A ribozyme having specificity for an Alloiococcus ott/d/s-encoding 
nucleic acid can be designed based upon the nucleotide sequence of an 
Alloiococcus otitidis cDNA disclosed herein. For example, a derivative of a 

10 Tetrahymena L-1 9 IVS RNA can be constructed in which the nucleotide sequence of 
the active site is complementary to the nucleotide sequence to be cleaved in an 
Alloiococcus otitid/s-encoding mRNA. See, e.g., Cech etal. U.S. 4,987,071 and 
Cech et at. U.S. 5,1 16,742 both incorporated herein in their entirety by reference. 
Alternatively, Alloiococcus otitidis mRNA can be used to select a catalytic RNA 

15 having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., 
Bartel and Szostak, 1993. 

Alternatively Alloiococcus otitidis gene expression can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the 
Alloiococcus otitidis gene (e.g., the Alloiococcus otitidis gene promoter and/or 

20 enhancers) to form triple helical structures that prevent transcription of the 

Alloiococcus otitidis gene in target cells. See generally, Helene, 1 991 ; Helene et a/., 
1 992; and Maher, 1 992. 

Alloiococcus otitidis gene expression can also be inhibited using RNA 
interference (RNAi). This is a technique for post-transcriptional gene silencing 

25 (PTGS), in which target gene activity is specifically abolished with cognate double- 
stranded RNA (dsRNA). RNAi resembles in many aspects PTGS in plants and has 
been detected in many invertebrates including trypanosome, hydra, pianaria, 
nematode and fruit fly (Drosophila melangnostei). It may be involved in the 
modulation of transposable element mobilization and antiviral state formation. RNAi 

30 in mammalian systems is disclosed in WO 00/63364, which is incorporated by 

reference herein in its entirety. Basically, dsRNA of at least about 600 nucleotides, 
homologous to the target is introduced into the cell and a sequence specific reduction 
in gene activity is observed. 
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C. ALLOIOCOCCUS OTITIDtS POLYPEPTIDES 

In particular embodiments, the present invention provides isolated and 
5 purified Alloiococcus otitidis polypeptides. Preferably, an Alloiococcus otitidis 

polypeptide of the invention is a recombinant polypeptide. In certain embodiments, 
an Alloiococcus otitidis polypeptide of the present invention comprises the amino acid 
sequence that has at least 25% identity to the amino acid sequence of one of the 
even numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106, a 
10 biological equivalent thereof, or a fragment thereof. 

An Alloiococcus otitidis polypeptide according to the present invention 
encompasses a polypeptide that comprises: 1) the amino acid sequence shown in 
one of the even numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 
106) functional and non-functional naturally occurring variants or biological 
15 equivalents of Alloiococcus otitidis polypeptides of the even numbered sequences set 
forth in SEQ ID NO: 2 through SEQ ID NO: 106 and recombinantly produced variants 
or biological equivalents of Alloiococcus otitidis polypeptides set out in SEQ ID NO: 2 
through SEQ ID NO: 106) polypeptides isolated from organisms other than 
Alloiococcus otitidis (orthologs of Alloiococcus otitidis polypeptides.) 

20 A biological equivalent or variant of an Alloiococcus otitidis polypeptide 

according to the present invention encompasses 1) a polypeptide isolated from 
Alloiococcus otitidis] and 2) a polypeptide that contains substantial homology to an 
Alloiococcus otitidis polypeptide. 

Biological equivalents or variants of Alloiococcus otitidis include both 

25 functional and non-functionai Alloiococcus otitidis polypeptides. Functional biological 
equivalents or variants are naturally occurring amino acid sequence variants of an 
Alloiococcus otitidis polypeptide that maintain the ability to elicit an immunological or 
antigenic response in a subject. Functional variants will typically contain only 
conservative substitutions of one or more amino acids in any one of even numbered 

30 sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106 or substitution, 

deletion or insertion of non-critical residues in non-critical regions of the polypeptide. 

The present invention further provides non-/\//o/ococcus otitidis orthologues of 
Alloiococcus otitidis polypeptides. Orthologues of Alloiococcus otitidis polypeptides 
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are polypeptides that are isolated from non-Alloiococcus otitidis organisms and 
possess antigenic capabilities of the Alloiococcus otitidis polypeptide. Orthologues of 
an Alloiococcus otitidis polypeptide can readily be identified as comprising an amino 
acid sequence that is substantially homologous to one of the even numbered 

5 sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. 

Modifications and changes can be made in the structure of a polypeptide of 
the present invention and still obtain a molecule having Alloiococcus otitidis 
antigenicity. For example, certain amino acids can be substituted for other amino 
acids in a sequence without appreciable loss of antigenicity. Because it is the 

10 interactive capacity and nature of a polypeptide that defines that polypeptide's 

biological functional activity, certain amino acid sequence substitutions can be made 
in a polypeptide sequence (or, of course, its underlying DNA coding sequence) and 
nevertheless obtain a polypeptide with like properties. 

In making such changes, the hydropathic index of amino acids can be 

15 considered. The importance of the hydropathic amino acid index in conferring 

interactive biologic function on a polypeptide is generally understood in the art (Kyte 
& Doolittle, 1982). It is known that certain amino acids can be substituted for other 
amino acids having a similar hydropathic index or score and still result in a 
polypeptide with similar biological activity. Each amino acid has been assigned a 

20 hydropathic index on the basis of its hydrophobicity and charge characteristics. 
Those indices are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine 
(+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); 
threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); 
histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); 

25 lysine (-3.9); and arginine (-4.5). 

It is believed that the relative hydropathic character of the amino acid residue 
determines the secondary and tertiary structure of the resultant polypeptide, which in 
turn defines the interaction of the polypeptide with other molecules, such as 
enzymes, substrates, receptors, antibodies, antigens, and the like. It is known in the 

30 art that an amino acid can be substituted by another amino acid having a similar 
hydropathic index and still obtain a functionally equivalent polypeptide. In such 
changes, the substitution of amino acids whose hydropathic indices are within +/-2 is 
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preferred, those within +/-1 are particularly preferred, and those within +/-0.5 are 
even more particularly preferred. 

Substitution of like amino acids can also be made on the basis of 
hydrophilicity, particularly where the biologically functional equivalent polypeptide or 

5 peptide thereby created is intended for use in immunological embodiments. U.S. Pat. 
No. 4,554,101, incorporated herein by reference, states that the greatest local 
average hydrophilicity of a polypeptide, as governed by the hydrophilicity of its 
adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. with a 
biological property of the polypeptide. 

10 As detailed in U.S. Pat. No. 4,554,101 , the following hydrophilicity values 

have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ±1); glutamate (+3.0 ±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); proline (-0.5 ±1); threonine (-0.4); alanine (-0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 

15 phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent polypeptide. 
In such changes, the substitution of amino acids whose hydrophilicity values are 
within ±2 is preferred, those which are within ±1 are particularly preferred, and those 

20 within ±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on 
the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions 
which take various of the foregoing characteristics into consideration are well known 

25 to those of skill in the art and include: arginine and lysine; glutamate and aspartate; 
serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine 
(See Table 3, below). The present invention thus contemplates functional or 
biological equivalents of an Afloiococcus otitidis polypeptide as set forth above. 
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TABLE 3: 
AMINO ACID SUBSTITUTIONS 



Original Residue Exemplary Residue 

Substitution 



Ala 


Gly; Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Giu 


Cys 


Ser 


Gin 


Asn 


Giu 


Asp 


Gly 


Ala 


His 


Asn; Gin 


lie 


Leu; Val 


Leu 


lie; Val 


Lys 


Arg 


Met 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp; Phe 


Val 


He; Leu 



Biological or functional equivalents of a polypeptide are also prepared using 
5 site-specific mutagenesis. Site-specific mutagenesis is a technique useful in the 
preparation of second generation polypeptides, or biologically functional equivalent 
polypeptides or peptides, derived from the sequences thereof, through specific 
mutagenesis of the underlying DNA. As noted above, such changes can be 
desirable where amino acid substitutions are desirable. The technique further 

10 provides a capacity to prepare and test sequence variants, for example, incorporating 
one or more of the foregoing considerations, by introducing one or more nucleotide 
sequence changes into the DNA. Site-specific mutagenesis allows the production of 
mutants through the use of specific oligonucleotide sequences which encode the 
DNA sequence of the desired mutation, as well as a sufficient number of adjacent 

15 nucleotides, to provide a primer sequence of sufficient size and sequence complexity 
to form a stable duplex on both sides of the deletion junction being traversed. 
Typically, a primer of about 17 to 25 nucleotides in length is preferred, with about 5 to 
10 residues on both sides of the site of the alteration of the sequence. 

In general, the technique of site-specific mutagenesis is well known in the art. 

20 As will be appreciated, the technique typically employs a phage vector, that can exist 
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in both a single stranded and double stranded form. Typically, site-directed 
mutagenesis in accordance herewith is performed by first obtaining a single-stranded 
vector which includes within its sequence a DNA sequence which encodes all or a 
portion of the Alloiococcus otitidis polypeptide sequence selected. An 

5 oligonucleotide primer bearing the desired mutated sequence is prepared (e.g., 
synthetically). This primer is then annealed to the singled-stranded vector, and 
extended by the use of enzymes such as Escherichia coli polymerase I Klenow 
fragment, in order to complete the synthesis of the mutation-bearing strand. Thus, a 
heteroduplex is formed wherein one strand encodes the original non-mutated 

10 sequence and the second strand bears the desired mutation. This heteroduplex 
vector is then used to transform appropriate cells such as Escherichia coli cells and 
clones are selected which include recombinant vectors bearing the mutation. 
Commercially available kits come with all the reagents necessary, except the 
oligonucleotide primers. 

15 An Alloiococcus otitidis polypeptide or polypeptide antigen of the present 

invention is understood to be any Alloiococcus otitidis polypeptide comprising 
substantial sequence similarity, structural similarity and/or functional similarity to an 
Alloiococcus otitidis polypeptide comprising the amino acid sequence of one of the 
even numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. In 

20 addition, an Alloiococcus otitidis polypeptide or polypeptide antigen of the invention is 
not limited to a particular source. Thus, the invention provides for the general 
detection and isolation of the polypeptides from a variety of sources. 

It is contemplated in the present invention, that an Alloiococcus otitidis 
polypeptide may advantageously be cleaved into fragments for use in further 

25 structural or functional analysis, or in the generation of reagents such as Alloiococcus 
otitidis-related polypeptides and Alloiococcus of/f/d/s-specific antibodies. This can be 
accomplished by treating purified or unpurified Alloiococcus otitidis polypeptides with 
a peptidase such as endoproteinase glu-C (Boehringer, Indianapolis, IN). Treatment 
with CNBr is another method by which peptide fragments may be produced from 

30 natural Alloiococcus otitidis polypeptides. Recombinant techniques also can be used 
to produce specific fragments of an Alloiococcus otitidis polypeptide. 

In addition, the inventors also contemplate that compounds sterically similar 
to a particular Alloiococcus otitidis polypeptide antigen, called peptidomimetics, may 
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be formulated to mimic the key portions of the peptide structure. Peptidemimetics 
are peptide-containing molecules that mimic elements of protein secondary structure. 
(See, for example, Johnson et a/., 1993.) The underlying rationale behind the use of 
peptide mimetics is that the peptide backbone of proteins exists chiefly to orient 
5 amino acid side chains in such a way as to facilitate molecular interactions, such as 
those of receptor and ligand. 

Successful applications of the peptide mimetic concept have thus far focused 
on mimetics of p-turns within proteins. Likely (3-turn structures, within Alloiococcus 
otitidis, can be predicted by computer-based algorithms as discussed above. Once 
10 the component amino acids of the turn are determined, mimetics can be constructed 
to achieve a similar spatial orientation of the essential elements of the amino acid 
side chains, as discussed in Johnson et a/., 1993. 

Fragments of the Alloiococcus otitidis polypeptides are also included in the 
invention. A fragment is a polypeptide having an amino acid sequence that entirely is 
15 the same as a part, but not all, of the amino acid sequence. The fragment can 
comprise, for example, at least 7 or more (e.g., 8, 10 12, 14, 16, 18, 20 or more) 
contiguous amino acids of an one of amino acid sequence selected from one of the 
even numbered sequences set forth in SEQ ID NO.: 2 through SEQ ID NO.: 106. 
Fragments may be "freestanding" or comprised within a larger polypeptide of which 
20 they form a part or region, most preferably as a single, continuous region. In one 
embodiment, the fragments include at least one epitope of the mature polypeptide 
sequence. 

"Fusion protein" refers to a protein encoded by two, often unrelated, fused 
genes or fragments thereof. For example, fusion proteins comprising various 

25 portions of constant region of immunoglobulin molecules together with another 

human protein or part thereof have been described. In many cases, employing an 
immunoglobulin Fc region as a part of a fusion protein is advantageous for use in 
therapy and diagnosis resulting in, for example, improved pharmacokinetic properties 
(see, e.g., EP-A 0232 2621). On the other hand, for some uses it would be desirable 

30 to be able to delete the Fc part after the fusion protein has been expressed, detected 
and purified. 
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D. ALLOIOCOCCUS OTITID1S POLYNUCLEOTIDE AND POLYPEPTIDE VARIANTS 

"Variant" as the term is used herein, is a polynucleotide or polypeptide that 
differs from a reference polynucleotide or polypeptide respectively, but retains 
5 essential properties. A typical variant of a polynucleotide differs in nucleotide 
sequence from another, reference polynucleotide. Changes in the nucleotide 
sequence of the variant may or may not alter the amino acid sequence of a 
polypeptide encoded by the reference polynucleotide. Nucleotide changes may 
result in amino acid substitutions, additions, deletions, fusions and truncations in the 
10 polypeptide encoded by the reference sequence, as discussed below. A typical 
variant of a polypeptide differs in amino acid sequence from another, reference 
polypeptide. Generally, differences are limited so that the sequences of the 
reference polypeptide and the variant are closely similar overall and, in many 
regions, identical. A variant and reference polypeptide may differ in amino acid 
15 sequence by one or more substitutions, additions and deletions in any combination. 
A substituted or inserted amino acid residue may or may not be one encoded by the 
genetic code. A variant of a polynucleotide or polypeptide may be a naturally 
occurring variant such as an allelic variant, or it may be a variant that is not known to 
occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides 
20 may be made by mutagenesis techniques or by direct synthesis. 

"Identity," as known in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by 
comparing the sequences. In the art, "identity" also means the degree of sequence 
relatedness between polypeptide or polynucleotide sequences, as the case may be, 
25 as determined by the match between strings of such sequences. "Identity" can be 
readily calculated by known methods, including but not limited to those described in 
(Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New 
York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, 
30 Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 
Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 
1991 ; and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48: 1073 (1988). 
Preferred methods to determine identity are designed to give the largest match 
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between the sequences tested. Methods to determine identity are codified in publicly 
available computer programs. Preferred computer program methods to determine 
identity between two sequences include, but are not limited to, the GCG program 
package (Devereux, J., et a/ 1984), BLASTP, BLASTN, and FASTA (Altschul, S. R, 
5 et a/., 1990. The BLASTX program is publicly available from NCBI and other sources 
(BLAST Manual, Altschul, S. f et a/., NCBI NLM NIH Bethesda, Md. 20894; Altschul, 
S., et ai, 1 990). The well known Smith-Waterman algorithm may also be used to 

m 

determine identity. 

By way of example, a polynucleotide sequence of the present invention may 

10 be identical to the reference sequence of one of SEQ ID NO:1 through SEQ ID NO: 
105, that is be 100% identical, or it may include up to a certain integer number of 
nucleotide alterations as compared to the reference sequence. Such alterations are 
selected from the group consisting of at least one nucleotide deletion, substitution, 
including transition and transversion, or insertion, and wherein said alterations may 

15 occur at the 5' or 3' terminal positions of the reference nucleotide sequence or 

anywhere between those terminal positions, interspersed either individually among 
the nucleotides in the reference sequence or in one or more contiguous groups within 
the reference sequence. The number of nucleotide alterations is determined by 
multiplying the total number of nucleotides in one of the odd numbered sequences 

20 set forth in SEQ ID NO: 1 through SEQ ID NO: 105 by the numerical percent of the 
respective percent identity (divided by 100) and subtracting that product from said 
total number of nucleotides in one of the odd numbered sequences set forth in SEQ 
ID NO: 1 through SEQ ID NO: 105. 

For example, the alterations in an isolated Alloiococcus otitidis polynucleotide 

25 comprise a polynucleotide sequence that has at least 70% identity to the nucleic acid 
sequence of one of the odd numbered sequences set forth in SEQ ID NO: 1 through 
SEQ ID NO: 105; a degenerate variant thereof or a fragment thereof, wherein the 
polynucleotide sequence may include up to n n nucleic acid alterations over the entire 
polynucleotide region of the nucleic acid sequence of any on of the odd numbered 

30 sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 105, wherein n n is the 
maximum number of alterations and is calculated by the formula: 

n n < x n -(Xn*y), 
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in which x n is the total number of nucleic acids of one of SEQ ID NO:1 through SEQ 
ID NO:105 and y has a value of 0.70, wherein any non-integer product of x n and y is 
rounded down to the nearest integer prior to subtracting such product from x„. Of 
course, y may also have a value of 0.80 for 80%, 0.85 for 85%, 0.90 for 90% 0.95 for 
5 95%, eta 

Similarly, a polypeptide sequence of the present invention may be identical to 
the reference sequence of any one of even numbered sequences set forth in SEQ ID 
NO: 2 through SEQ ID NO: 106, that is 100% identical, or it may include up to a 
certain integer number of amino acid alterations as compared to the reference 

10 sequence such that the percentage identity is less than 100%. Such alterations are 
selected from the group consisting of at least one amino acid deletion, substitution, 
including conservative and non-conservative substitution, or insertion, and wherein 
said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 

15 interspersed either individually among the amino acids in the reference sequence or 
in one or more contiguous groups within the reference sequence. The number of 
amino acid alterations for a given % identity is determined by multiplying the total 
number of amino acids in one of the even numbered sequences set forth in SEQ ID 
NO: 2 through SEQ ID NO: 106 by the numerical percent of the respective percent 

20 identity (divided by 100) and then subtracting that product from said total number of 
amino acids in one of the even numbered sequences set forth in SEQ ID NO: 2 
through SEQ ID NO: 106, or: 

n a <x a -(x a *y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino 
25 acids in one of SEQ ID NO: 2 through SEQ ID NO: 106, and y is, for instance 0.70 for 
70%, 0.80 for 80%, 0.85 for 85% etc., and wherein any non-integer product of 
x.sub.a and y is rounded down to the nearest integer prior to subtracting it from x a . 

E. Vectors, Host Cells and Recombinant Alloiococcus otitidis 
30 Polypeptides 

In a preferred embodiment, the present invention provides expression vectors 
comprising ORF polynucleotides that encode Alloiococcus otitidis polypeptides. 
Preferably, the expression vectors of the present invention comprise ORF 
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polynucleotides that encode Alloiococcus otitidis polypeptides comprising the amino 
acid residue sequence of one of the even numbered sequences set forth in SEQ ID 
NO: 2 through SEQ ID NO: 106. More preferably, the expression vectors of the 
present invention comprise a polynucleotide comprising the nucleotide base 
5 sequence of the odd numbered sequences set forth in SEQ ID NO: 1 through SEQ 
ID NO: 105. Even more preferably, the expression vectors of the invention comprise 
a polynucleotide operatively linked to promoter. Still more preferably, the expression 
vectors of the invention comprise a polynucleotide operatively linked to a prokaryotic 
promoter. Alternatively, the expression vectors of the present invention comprise a 
10 polynucleotide operatively linked to an enhancer-promoter, that is, an eukaryotic 
promoter. The expression vectors further comprise a polyadenylation signal that is 
positioned 3' of the carboxy-teirninal amino acid and within a transcriptional unit of 
the encoded polypeptide. 

Expression of proteins in prokaryotes is most often carried out in Escherichia 
15 coli with vectors containing constitutive or inducible promoters directing the 

expression of either fusion or non-fusion proteins. Fusion vectors add a number of 
amino acids to a protein encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve three purposes: 1) to 
increase expression of recombinant protein; 2) to increase the solubility of the 
20 recombinant protein; and 3) to aid in the purification of the recombinant protein by 
acting as a ligand in affinity purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein from the fusion 
moiety subsequent to purification of the fusion protein. Such enzymes, and their 
25 cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 

. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; 
Smith and Johnson, 1988), pMAL (New England Biolabs, Beverly; MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S- transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 
30 In one embodiment, the coding sequence of the Alloiococcus otitidis 

polynucleotide is cloned into a pGEX expression vector to create a vector encoding a 
fusion protein comprising, from the N-terminus to the C-terminus, GST-thrombin 
cleavage s\te-Alloiococcus otitidis polypeptide. The fusion protein can be purified by 
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affinity chromatography using glutathione-agarose resin. Recombinant Alloiococcus 
otitidis polypeptide unfused to GST can be recovered by cleavage of the fusion 
protein with thrombin. 

Examples of suitable inducible non-fusion Escherichia coli expression vectors 
5 include pTrc (Amann et aL, 1 988) and pET I I d (Studier et a/., 1 990). Target gene 
expression from the pTrc vector relies on host RNA polymerase transcription from a 
hybrid trp-lac fusion promoter. Target gene expression from the pET I I d vector 
relies on transcription from a T7 gn1 0-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase T7 gnl. This viral polymerase is supplied by host 
10 strains BL21 (DE3) or HMS I 74(DE3) from a resident prophage harboring a T7 gn1 
gene under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in Escherichia coli 
is to express the protein in a host bacterium with an impaired capacity to 
proteolytically cleave the recombinant protein. Another strategy is to alter the nucleic 
15 acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in Escherichia 
coli. Such alteration of nucleic acid sequences of the invention can be carried out by 
standard DNA mutagenesis or synthesis techniques. 

In another embodiment, the Alloiococcus otitidis polynucleotide expression 
20 vector is a yeast expression vector. Examples of vectors for expression in a yeast 
such as S. cerevisiae include pYepSec I (Baldari, et a/., 1987), pMFa (Kurjan and 
Herskowitz, 1982), pJRY88 (Schultz et a/., 1987), and pYES2 (Invitrogen 
Corporation, San Diego, CA). 

Alternatively, an Alloiococcus otitidis polynucleotide is expressed in insect 
25 cells using, for example, baculovirus expression vectors. Baculovirus vectors 

available for expression of proteins in cultured insect cells (e.g., Sf 9 or Sf 21 cells) 
include the pAc series (Smith et a/., 1983) and the pVL series (Lucklow and 
Summers, 1989). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
30 mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1 987) and pMT2PC (Kaufman et ai t 
1987). When used in mammalian cells, the expression vector's control functions are 
often provided by viral regulatory elements. 
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As used herein, a promoter is a region of a DNA molecule typically within 
about 100 nucleotide pairs in front of (upstream of) the point at which transcription 
begins (Le., a transcription start site). That region typically contains several types of 
DNA sequence elements that are located in similar relative positions in different 

5 genes. As used herein, the term "promoter" includes what is referred to in the art as 
an upstream promoter region, a promoter region or a promoter of a generalized 
eukaryotic RNA Polymerase II transcription unit. 

Another type of discrete transcription regulatory sequence element is an 
enhancer. An enhancer provides specificity of time, location and expression level for 

10 a particular encoding region (e.g., gene). A major function of an enhancer is to 

increase the level of transcription of a coding sequence in a cell that contains one or 
more transcription factors that bind to that enhancer. Unlike a promoter, an enhancer 
can function when located at variable distances from transcription start sites so long 
as a promoter is present. 

15 As used herein, the phrase "enhancer-promoter" means a composite unit that 

contains both enhancer and promoter elements. An enhancer-promoter is 
operatively linked to a coding sequence that encodes at least one gene product. As 
used herein, the phrase "operatively linked" means that an enhancer-promoter is 
connected to a coding sequence in such a way that the transcription of that coding 

20 sequence is controlled and regulated by that enhancer-promoter. Means for 

operatively linking an enhancer-promoter to a coding sequence are well known in the 
art. As is also well known in the art, the precise orientation and location relative to a 
coding sequence whose transcription is controlled, is dependent inter alia upon the 
specific nature of the enhancer-promoter. Thus, a TATA box minimal promoter is 

25 typically located from about 25 to about 30 base pairs upstream of a transcription 
initiation site and an upstream promoter element is typically located from about 100 
to about 200 base pairs upstream of a transcription initiation site. In contrast, an 
enhancer can be located downstream from the initiation site and can be at a 
considerable distance from that site. 

30 An enhancer-promoter used in a vector construct of the present invention can 

b e an y enhancer-promoter that drives expression in a cell to be transfected. By 
employing an enhancer-promoter with well-known properties, the level and pattern of 
gene product expression can be optimized. 
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For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus (CMV) and Simian Virus 40 (SV40). For other 
suitable expression systems for both prokaryotic and eukaryotic cells see chapters 
16 and 17 of Sambrook et a/., "Molecular Cloning: A Laboratory Manual" 2nd, ed, 
5 Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989, incorporated herein by reference. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell 
type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). 
10 Tissue- specific regulatory elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert 
eta/., 1987), lymphoid-specific promoters (Calame and Eaton, 1988), in particular 
promoters of T cell receptors (Winoto and Baltimore, 1 989) and immunoglobulins 
(Banerji et ai, 1983), Queen and Baltimore (1983), neuron-specific promoters (e.g., 
15 the neurofilament promoter; Byrne and Ruddle, 1 989), pancreas-specific promoters 
(Edlund et al. t 1985), and mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. 4, 873,316 and EP 264,166). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss, 
1990) and the a-fetoprotein promoter (Campes and Tilghman, 1989). 
20 The invention further provides a recombinant expression vector comprising a 

DNA molecule encoding an AHoiococcus otitidis polypeptide cloned into the 
expression vector in an antisense orientation. That is, the DNA molecule is 
operatively linked to a regulatory sequence in a manner which allows for expression 
(by transcription of the DNA molecule) of an RNA molecule which is antisense to 
25 AHoiococcus otitidis mRNA. Regulatory sequences operatively linked to a nucleic 
acid cloned in the antisense orientation can be chosen which direct the continuous 
expression of the antisense RNA molecule in a variety of cell types, for instance viral 
promoters and/or enhancers, or regulatory sequences can be chosen which direct 
constitutive, tissue specific or cell type specific expression of antisense RNA. The 
30 antisense expression vector can be in the form of a recombinant plasmid, phagemid 
or attenuated virus in which antisense nucleic acids are produced under the control 
of a high efficiency regulatory region, the activity of which can be determined by the 
cell type into which the vector is introduced. 



-44- 



WO 03/104391 



PCTAJS02/36122 



Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms 
"host cell" and "recombinant host cell" are used interchangeably herein. It is 
understood that such terms refer not only to the particular subject cell but also to the 

5 progeny or potential progeny of such a cell. Because certain modifications may 
occur in succeeding generations due to either mutation or environmental influences, 
such progeny may not, in fact, be identical to the parent cell, but are still included 
within the scope of the term as used herein. A host cell can be any prokaryotic or 
eukaryotic cell. For example, an Alloiococcus otitidis polypeptide can be expressed 

10 in bacterial cells such as Escherichia coli, insect cells, yeast or mammalian cells 

(such as Chinese hamster ovary cells (CHO), NIH3T3, PER C6, NSO, VERO or COS 
cells). Other suitable host cells are known to those skilled in the art. 

Vector DNA is can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation, infection or transfection techniques. As used herein, the 

15 terms "transformation" and "transfection" are intended to refer to a variety of art- 
recognized techniques for introducing foreign nucleic acid {e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran- 
mediated transfection, lipofection, protoplast fusion, direct microinfection. Another 
recognized technique for introducing DNA into a host cell is "infection", such as by 

20 adenovirus infection or electroporation. Suitable methods for transforming, infecting 
or transfecting host cells can be found in Sambrook, et ai ("Molecular Cloning: A 
Laboratory Manual" 2nd ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 
The most widely used method is transfection mediated by either calcium 

25 phosphate or DEAE-dextran. Although the mechanism remains unclear, it is 

believed that the transfected DNA enters the cytoplasm of the cell by endocytosis 
and is transported to the nucleus. Depending on the cell type, up to 90% of a 
population of cultured cells can be transfected at any one time. Because of its high 
efficiency, transfection mediated by calcium phosphate or DEAE-dextran is the 

30 method of choice for experiments that require transient expression of the foreign 
DNA in large numbers of cells. Calcium phosphate-mediated transfection is also 
used to establish cell lines that integrate copies of the foreign DNA, which are usually 
arranged in head-to-tail tandem arrays into the host cell genome. 
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In the protoplast fusion method, protoplasts derived from bacteria carrying 
high numbers of copies of plasmid of interest are mixed directly with cultured 
mammalian cells. After fusion of the cell membranes (usually with polyethylene 
glycol), the contents of the bacteria are delivered into the cytoplasm of the 

5 mammalian cells and the plasmid DNA is transported to the nucleus. Protoplast 
fusion is not as efficient as transfection for many of the cell lines that are commonly 
used for transient expression assays, but it is useful for cell lines in which 
endocytosis of DNA occurs inefficiently. Protoplast fusion frequently yields multiple 
copies of the plasmid DNA tandemly integrated into the host chromosome. 

10 The application of brief, high-voltage electric pulses (eiectroporation) to a 

variety of mammalian and plant cells leads to the formation of nanometer-sized pores 
in the plasma membrane. DNA is taken directly into the cell cytoplasm either through 
these pores or as a consequence of the redistribution of membrane components that 
accompanies closure of the pores. Eiectroporation can be extremely efficient and 

15 can be used both for transient expression of cloned genes and for establishment of 
cell lines that carry integrated copies of the gene of interest. Eiectroporation, in 
contrast to calcium phosphate-mediated transfection and protoplast fusion, frequently 
gives rise to cell lines that carry one, or at most a few, integrated copies of the 
foreign DNA. 

20 Liposome transfection involves encapsulation of DNA and RNA within 

liposomes, followed by fusion of the liposomes with the cell membrane. The 
mechanism of how DNA is delivered into the cell is unclear, but transfection 
efficiencies can be as high as 90%. 

Direct microinjection of a DNA molecule into nuclei has the advantage of not 
25 exposing DNA to cellular compartments such as low-pH endosomes. Microinjection 
therefore used primarily as a method to establish lines of cells that carry integrated 
copies of the DNA of interest. 

The use of adenovirus as a vector for cell transfection is well known in the art. 
Adenovirus vector-mediated cell transfection has been reported for various cells 
30 (Stratford-Perricaudet, et al. 1 992). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, is used to produce (i.e., express) an Alloiococcus otitidis polypeptide. 
Accordingly, the invention further provides methods for producing an Alloiococcus 
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otiticlis polypeptide using the host cells of the invention. In one embodiment, the 
method comprises culturing the host cell of invention (into which a recombinant 
expression vector encoding an Alloiococcus otitidis polypeptide has been introduced) 
in a suitable medium until the Alloiococcus otitidis polypeptide is produced. In 

5 another embodiment, the method further comprises isolating the Alloiococcus otitidis 
polypeptide from the medium or the host cell. 

A coding sequence of an expression vector is operatively linked to a 
transcription-terminating region. RNA polymerase transcribes an encoding DNA 
sequence through a site where polyadenylation occurs. Typically, DNA sequences 

10 located a few hundred base pairs downstream of the polyadenylation site serve to 
terminate transcription. Those DNA sequences are referred to herein as 
transcription-termination regions. Those regions are required for efficient 
polyadenylation of transcribed messenger RNA (mRNA). Transcription-terminating 
regions are well known in the art. A preferred transcription-terminating region used in 

15 an adenovirus vector construct of the present invention comprises a polyadenylation 
signal of SV40 or the protamine gene. 

An expression vector comprises a polynucleotide that encodes an 
Alloiococcus otitidis polypeptide. Such a polypeptide is meant to include a sequence 
of nucleotide bases encoding an Alloiococcus otitidis polypeptide sufficient in length 

20 to distinguish the segment from a polynucleotide segment encoding a non- 

Alloiococcus otitidis polypeptide. A polypeptide of the invention can also encode 
biologically functional polypeptides or peptides which have variant amino acid 
sequences, such as with changes selected based on considerations such as the 
relative hydropathic score of the amino acids being exchanged. These variant 

25 sequences are those isolated from natural sources or induced in the sequences 
disclosed herein using a mutagenic procedure such as site-directed mutagenesis. 

Preferably, an expression vector of the present invention comprises a 
polynucleotide that encodes a polypeptide comprising the amino acid residue 
sequence of one of the even numbered sequences set forth in SEQ ID NO: 2 through 

30 SEQ ID NO:.4036 An expression vector can include an Alloiococcus otitidis 

polypeptide coding region itself of any of the Alloiococcus otitidis polypeptides noted 
above or it can contain coding regions bearing selected alterations or modifications in 
the basic coding region of such an Alloiococcus otitidis polypeptide. Alternatively, 
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such vectors or fragments can also encode larger polypeptides or polypeptides which 
nevertheless include the basic coding region. In any event, it should be appreciated 
that due to codon redundancy as well as biological functional equivalence, this 
aspect of the invention is not limited to the particular DNA molecules corresponding 

5 to the polypeptide sequences noted above. 

Exemplary vectors include the mammalian expression vectors of the pCMV 
family including pCMV6b and pCMV6c (Chiron Corp., Emeryville CA.). In certain 
cases, and specifically in the case of these individual mammalian expression vectors, 
the resulting constructs can require co-transfection with a vector containing a 

10 selectable marker such as pSV2neo. Via co-transfection into a dihydrofolate 
reductase-deficient Chinese hamster ovary cell line, such as DG44, clones 
expressing Alloiococcus otitidis polypeptides by virtue of DNA incorporated into such 
expression vectors can be detected. 

A DNA molecule of the present invention can be incorporated into a vector by 

15 a number of techniques that are well known in the art. For instance, the vector 

pUC18 has been demonstrated to be of particular value in cloning and expression of 
genes. Likewise, the related vectors M13mp18 and M13mp19 can also be used in 
certain embodiments of the invention, in particular, in performing dideoxy 
sequencing. 

20 An expression vector of the present invention is useful both as a means for 

preparing quantities of the Alloiococcus otitidis polypeptide-encoding DNA itself, and 
as a means for preparing the encoded polypeptide and peptides. It is contemplated 
that where Alloiococcus otitidis polypeptides of the invention are made by 
recombinant means, one can employ either prokaryotic or eukaryotic expression 

25 vectors as shuttle systems. In another aspect, the recombinant host cells of the 

present invention are prokaryotic host cells. Preferably, the recombinant host cells of 
the invention are bacterial cells of the DH5oc strain of Escherichia coli. In general, 
prokaryotes are preferred for the initial cloning of DNA sequences and constructing 
the vectors useful in the invention. For example, Escherichia coli K1 2 strains can be 

30 particularly useful. Other microbial strains that can be used include Escherichia coli 
B, Escherichia co//W3110 (ATCC No. 273325) and Escherichia, coltf 976 (ATCC 
No. 31537). Bacilli such as Bacillus subtilis, or other enterobacteriaceae such as 
Salmonella typhimurium or other Salmonella species or Serratia marcesans, and 
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various pseudomonas species can be used. These examples are, of course, 
intended to be illustrative rather than limiting. 

In general, plasmid vectors containing replicon and control sequences that 
are derived from species compatible with the host cell are used in connection with 

5 these hosts. The vector ordinarily carries a replication site, as well as marking 

sequences that are capable of providing phenotypic selection in transformed cells. 
For example, Escherichia coli can be transformed using pBR322, a plasmid derived 
from an Escherichia coli species (Bolivar, et al. 1 977). pBR322 contains genes for 
ampicillin and tetracycline resistance and thus provides easy means for identifying 

10 transformed cells. The pBR322 plasmid, or other microbial plasmid or phage, must 
also contain, or be modified to contain, promoters which can be used by the microbial 
organism for expression of its own polypeptides. 

Those promoters most commonly used in recombinant DNA construction 
include the p-lactamase (penicillinase) and lactose promoter systems (Chang, et al. 

15 1978; Itakura., et al. 1977, Goeddel, et al. 1979; Goeddel, etal. 1980) and a 

tryptophan (TRP) promoter system (EP 0036776; Siebwenlist etal. 1980). While 
these are the most commonly used, other microbial promoters have been discovered 
and utilized, and details concerning their nucleotide sequences have been published, 
enabling a skilled worker to introduce functional promoters into plasmid vectors 

20 (Siebwenlist, et al. 1 980) . 

In addition to prokaryotes, eukaryotic microbes such as yeast can also be 
used. Saccharomyces cerevisiase or common baker's yeast is the most commonly 
used among eukaryotic microorganisms, although a number of other strains are 
commonly available. For expression in Saccharomyces, the plasmid YRp7, for 

25 example, is commonly used (Stinchcomb, et al. 1 979; Kingsman, et al. 1 979; 

Tschemper, etal. 1980). This plasmid already contains the trpl gene that provides a 
selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, 
for example ATCC No. 44076 or PEP4-1 (Jones, 1977). The presence of the trpl 
lesion as a characteristic of the yeast host cell genome then provides an effective 

30 environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 3- 
phosphoglycerate kinase (PGK) (Hitzeman, etal. 1980) or other glycolytic enzymes 
(Hess, etal. 1968; Holland, etal. 1978) such as enolase, glyceraldehyde-3- 
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phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphogiucose isomerase, and 
glucokinase. In constructing suitable expression plasmids, the termination sequences 

5 associated with these genes are also introduced into the expression vector 

downstream from the sequences to be expressed to provide polyadenylation of the 
mRNA and termination. Other promoters, which have the additional advantage of 
transcription controlled by growth conditions are the promoter region for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes 

10 associated with nitrogen metabolism, and the aforementioned glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose 
utilization. Any plasmid vector containing a yeast-compatible promoter, origin of 
replication, and termination sequences is suitable. 

In addition to microorganisms, cultures of cells derived from multicellular 

15 organisms can also be used as hosts. In principle, any such cell culture is workable, 
whether from vertebrate or invertebrate culture. However, interest has been greatest 
in vertebrate cells, and propagation of vertebrate cells in culture (tissue culture) has 
become a routine procedure in recent years. Examples of such useful host cell lines 
are AtT-20, VERO, HeLa, NSO, PER C6, Chinese hamster ovary (CHO) cell lines, 

20 W138, BHK, COSM6, COS-7, 293 , VERO and MDCK cell lines. Expression vectors 
for such cells ordinarily include (if necessary) an origin of replication, a promoter 
located upstream of the gene to be expressed, along with any necessary ribosome 
binding sites, RNA splice sites, polyadenylation site, and transcriptional terminator 
sequences. 

25 Where expression of recombinant Alloiococcus otitidis polypeptides is desired 

and a eukaryotic host is contemplated, it is most desirable to employ a vector, such 
as a plasmid, that incorporates a eukaryotic origin of replication. Additionally, for the 
purposes of expression in eukaryotic systems, one desires to position the 
Alloiococcus otitidis encoding sequence adjacent to and under the control of an 

30 effective eukaryotic promoter such as promoters used in combination with Chinese 
hamster ovary cells (CHO). To bring a coding sequence under control of a promoter, 
whether it is eukaryotic or prokaryotic, what is generally needed is to position the 5' 
end of the translation initiation side of the proper translational reading frame of the 
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polypeptide between about 1 and about 50 nucleotides 3' of or downstream with 
respect to the promoter chosen. Furthermore, where eukaryotic expression is 
anticipated, one would typically desire to incorporate an appropriate polyadenylation 
site into the transcriptional unit that includes the Alloiococcus otitidis polypeptide. 
5 A transf ected cell can be prokaryotic or eukaryotic. Preferably, the host cells 

of the invention are prokaryotic host cells. Where it is of interest to produce an 
Alloiococcus otitidis polypeptide, cultured prokaryotic host cells are of particular 
interest. 

In yet another embodiment, the present invention contemplates a process or 

10 method of preparing Alloiococcus otitidis polypeptides comprising transf ecting, 
- transforming or infecting cells with a polynucleotide that encodes an Alloiococcus 
otitidis polypeptide to produce transformed host cells; and maintaining the 
transformed host cells under biological conditions sufficient for expression of the 
polypeptide. Preferably, the transformed host cells are prokaryotic cells. 

15 Alternatively, the host cells are eukaryotic cells. More preferably, the prokaryotic 
cells are bacterial cells of the DH5a strain of Escherichia coll Even more preferably, 
the polynucleotide transf ected into the transformed cells comprises the nucleic acid 
sequence of one of the odd numbered sequences set forth in SEQ ID NO: 1 through 
SEQ ID NO: 105. Additionally, transfection is accomplished using an expression 

20 vector disclosed above. A host cell used in the process is capable of expressing a 
functional, recombinant Alloiococcus otitidis polypeptide. 

Following transfection, the cell is maintained under culture conditions for a 
period of time sufficient for expression of an Alloiococcus otitidis polypeptide. Culture 
conditions are well known in the art and include ionic composition and concentration, 

25 temperature, pH and the like. Typically, transfected cells are maintained under 

culture conditions in a culture medium. Suitable media for various cell types are well 
known in the art. in a preferred embodiment, temperature is from about 20°C to 
about 50°C, more preferably from about 30°C to about 40°C and, even more 

preferably about 37°C. 
30 The pH is preferably from about a value of 6.0 to a value of about 8.0, more 

preferably from about a value of about 6.8 to a value of about 7.8 and, most 
preferably about 7.4. Osmolality is preferably from about 200 milliosmols per liter 
(mosm/L) to about 400 mosm/l and, more preferably from about 290 mosm/L to 
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about 31 0 mosm/L. Other biological conditions needed for transfection and 

expression of an encoded protein are well known in the art. 

Transfected cells are maintained for a period of time sufficient for expression 

of an Alloiococcus otitidis polypeptide. A suitable time depends inter alia upon the 
5 cell type used and is readily determinable by a skilled artisan. Typically, 

maintenance time is from about 2 to about 14 days. 

Recombinant Alloiococcus otitidis polypeptide is recovered or collected either 

from the transfected cells or the medium in which those cells are cultured. Recovery 

comprises isolating and purifying the Alloiococcus otitidis polypeptide. Isolation and 
10 purification techniques for polypeptides are well known in the art and include such 

procedures as precipitation, filtration, chromatography, electrophoresis and the like. 

F. Antibodies Immunoreactive with Alloiococcus otitidis Polypeptides 

15 in still another embodiment, the present invention provides antibodies 

immunoreactive with Alloiococcus otitidis polypeptides. Preferably, the antibodies of 
the invention are monoclonal antibodies. Additionally, the Alloiococcus otitidis 
polypeptides comprise the amino acid residue sequence of one of the even 
numbered sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 106. Means 

20 for preparing and characterizing antibodies are well known in the art (See, e.g., 
Antibodies "A Laboratory Manual", E. Howell and D. Lane, Cold Spring Harbor 
Laboratory, 1988). Polyclonal antisera is obtained by bleeding an immunized animal 
into a glass or plastic container, incubating the blood at 25°C for one hour, followed 
by incubating at 4°C for 2-18 hours. The serum is then recovered by centrifugation. 

25 Briefly, a polyclonal antibody is prepared by immunizing an animal with an 

immunogen comprising a polypeptide or polynucleotide of the present invention, and 
collecting antisera from that immunized animal. A wide range of animal species can 
be used for the production of antisera. Typically an animal used for production of 
anti-antisera is a rabbit, a mouse, a rat, a hamster or a guinea pig. Because of the 

30 relatively large blood volume of rabbits, a rabbit is a preferred choice for production 
of polyclonal antibodies. 

As is well known in the art, a given polypeptide or polynucleotide may vary in 
its immunogenicity. It is often necessary therefore to couple the immunogen (e.g., a 
polypeptide or polynucleotide) of the present invention with a carrier. Exemplary and 
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preferred carriers are keyhole limpet hemocyanin (KLH) and bovine serum albumin 
(BSA). Other albumins such as ovalbumin, mouse serum albumin or rabbit serum 
albumin can also be used as carriers. 

Means for conjugating a polypeptide or a polynucleotide to a carrier protein 
5 are we || known in the art and include glutaraldehyde, m-maleimidobencoyl-N- 
hydroxysuccinimide ester, carbodiimide and bis-biazotized benzidine. 

As is also well known in the art, immunogencity to a particular immunogen 
can be enhanced by the use of non-specific stimulators of the immune response 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's 
10 adjuvant, incomplete Freund's adjuvants, cholera toxin (e.g. mutant cholera toxin 
E29H; see published International Patent Application WO 00/18434), and aluminum 

hydroxide adjuvant. 

The amount of immunogen used for the production of polyclonal antibodies 
depends upon the nature of the immunogen as well as the animal used for 
15 immunization. A variety of routes can be used to administer the immunogen 

(subcutaneous, intramuscular, intradermal, intravenous and intraperitoneal). The 
production of polyclonal antibodies is monitored by sampling blood from the 
immunized animal at various points following immunization. When a desired level of 
immunogenicity is obtained, the immunized animal can be bled and the serum 

20 isolated and stored. 

In another aspect, the present invention contemplates a process of producing 
an antibody immunoreactive with an Alloiococcus otitidis polypeptide comprising the 
steps of (a) transfecting recombinant host cells with a polynucleotide that encodes an 
Alloiococcus otitidis polypeptide; (b) culturing the host cells under conditions 

25 sufficient for expression of the polypeptide; (c) recovering the polypeptides; and (d) 
preparing the antibodies to the polypeptides. Preferably, the host cell is transfected 
with the polynucleotide of one of the odd numbered sequences set forth in SEQ ID 
NO: 1 through SEQ ID NO: 4035. Even more preferably, the present invention 
provides antibodies prepared according to the process described above. 

30 A monoclonal antibody of the present invention can be readily prepared 

through use of well-known techniques such as those exemplified in U.S. Pat. No. 
4,196,265, herein incorporated by reference. Typically, a technique involves first 
immunizing a suitable animal with a selected antigen (e.g., a polypeptide or 
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polynucleotide of the present invention) in a manner sufficient to provide an immune 
response. Rodents such as mice and rats are preferred animals. Spleen cells from 
the immunized animal are then fused with cells of an immortal myeloma cell. Where 
the immunized animal is a mouse, a preferred myeloma cell is a murine NS-1 
5 myeloma cell. 

The fused spleen/myeloma cells are cultured in a selective medium to select 
fused spleen/myeloma cells from the parental cells. Fused cells are separated from 
the mixture of non-fused parental cells, e.g., by the addition of agents that block the 
de novo synthesis of nucleotides in the tissue culture media. Exemplary and 
10 preferred agents are aminopterin, methotrexate, and azaserine. Aminopterin and 
methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, 
the media is supplemented with hypoxanthine and thymidine as a source of 
nucleotides. Where azaserine is used, the media is supplemented with 

15 hypoxanthine. 

This culturing provides a population of hybridomas from which specific 
hybridomas are selected. Typically, selection of hybridomas is performed by 
culturing the cells by single-clone dilution in microtiter plates, followed by testing the 
individual clonal supernatants for reactivity with an antigen-polypeptide. The 

20 selected clones can then be propagated indefinitely to provide the monoclonal 
antibody. 

By way of specific example, to produce an antibody of the present invention, 
mice are injected intraperitoneal^ with between about 1-200 \ig of an antigen 
comprising a polypeptide of the present invention. B lymphocyte cells are stimulated 

25 to grow by injecting the antigen in association with an adjuvant such as complete 

Freund's adjuvant (CFA; a non-specific stimulator of the immune response containing 
killed Mycobacterium tuberculosis). At some time (e.g., at least two weeks) after the 
first injection, mice are boosted by injection with a second dose of the antigen mixed 
with incomplete Freund's adjuvant (I FA; lacks the killed mycobacterium of CFA). 

30 A few weeks after the second injection, mice are tail bled and the sera titered 

by immunoprecipitation against radiolabeled antigen. Preferably, the process of 
boosting and titering is repeated until a suitable titer is achieved. The spleen of the 
mouse with the highest titer is removed and the spleen lymphocytes are obtained by 
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homogenizing the spleen with a syringe. Typically, a spleen from an immunized 
mouse contains approximately 5x1 0 7 to 2x1 0 8 lymphocytes. 

Mutant lymphocyte cells known as myeloma cells are obtained from 
laboratory animals in which such cells have been induced to grow by a variety of 

5 well-known methods. Myeloma cells lack the salvage pathway of nucleotide 
biosynthesis. Because myeloma cells are tumor cells, they can be propagated 
indefinitely in tissue culture, and are thus denominated immortal. Numerous cultured 
cell lines of myeloma cells from mice and rats, such as murine NS-1 myeloma cells, 
have been established. 

10 Myeloma cells are combined under conditions appropriate to foster fusion 

with the normal antibody-producing cells from the spleen of the mouse or rat injected 
with the antigen/polypeptide of the present invention. Fusion conditions include, for 
example, the presence of polyethylene glycol. The resulting fused cells are 
hybridoma cells. Like myeloma cells, .hybridoma cells grow indefinitely in culture. 

15 Hybridoma cells are separated from unfused myeloma cells by culturing in a 

selection medium such as HAT media (hypoxanthine, aminopterin, thymidine). 
Unfused myeloma cells lack the enzymes necessary to synthesize nucleotides from 
the salvage pathway because they are killed in the presence of aminopterin, 
methotrexate, or azaserine. Unfused lymphocytes also do not continue to grow in 

20 tissue culture. Thus, only cells that have successfully fused (hybridoma cells) can 
grow in the selection media. 

Each of the surviving hybridoma cells produces a single antibody. These 
cells are then screened for the production of the specific antibody immunoreactive 
with an antigen/polypeptide of the present invention. Single cell hybridomas are 

25 isolated by limiting dilutions of the hybridomas. The hybridomas are serially diluted 
many times and, after the dilutions are allowed to grow, the supernatant is tested for 
the presence of the monoclonal antibody. The clones producing that antibody are 
then cultured in large amounts to produce an antibody of the present invention in 
convenient quantity. 

30 By use of a monoclonal antibody of the present invention, specific 

polypeptides and polynucleotide of the invention are identified as antigens. Once 
identified, those polypeptides and polynucleotide are isolated and purified by 
techniques such as antibody-affinity chromatography. In antibody-affinity 
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chromatography, a monoclonal antibody is bound to a solid substrate and exposed to 
a solution containing the desired antigen. The antigen is removed from the solution 
through an immunospecific reaction with the bound antibody. The polypeptide or 
polynucleotide is then easily removed from the substrate and purified. 

5 Additionally, examples of methods and reagents particularly amenable for use 

in generating and screening antibody display library can be found in, for example, 
U.S. 5,223,409; WO 92/18619; WO 91/17271; WO 92/20791; WO 92/15679; WO 
93/01288; WO 92/01047; WO 92/09690; WO 90/02809, which are incorporated 
herein in their entirety by reference. 

10 Additionally, recombinant anti->A//o/ococcas otitidis antibodies, such as 

chimeric and humanized monoclonal antibodies, comprising both human and non- 
human fragments, which are made using standard recombinant DNA techniques, are 
within the scope of the invention. Such chimeric and humanized monoclonal 
antibodies are produced by recombinant DNA techniques known in the art, for 

15 example using methods described in PCT/US86/02269; EP 184,1 87; EP 171 ,496; 
EP 173,494; WO 86/01533; U.S. 4,816,567; and EP 125,023. 

An anti-yV/o/ococct/s otitidis antibody (e.g., monoclonal antibody) is used to 
isolate Alioiococcus otitidis polypeptides by standard techniques, such as affinity 
chromatography or immunoprecipitation. An anti-v4//o/ococcus otitidis antibody 

20 facilitates the purification of a natural Alioiococcus otitidis polypeptide from cells and 
recombinantly produced Alioiococcus otitidis polypeptides expressed in host cells. 
Moreover, an an\\-Alloiococcus otitidis antibody is used to detect Alioiococcus otitidis 
polypeptide {e.g., in a cellular lysate or cell supernatant) in order to evaluate the 
abundance of the Alioiococcus otitidis polypeptide. The detection of circulating 

25 fragments of an Alioiococcus otitidis polypeptide is used to identify Alioiococcus 
otitidis polypeptide turnover in a subject. Anti-/4//o/ococcus otitidis antibodies are 
used diagnostically to monitor protein levels in tissue as part of a clinical testing 
procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. 
Detection is facilitated by coupling (i.e., physically linking) the antibody to a 

30 detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 
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acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylarnine fluorescein, dansyl chloride or phycoerythrin; an example of a 
5 luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and acquorin, and examples of suitable radioactive material 
include 125 l, 131 l, 15 Sor 3 H. 

G. Pharmaceutical Compositions 

l ° In certain embodiments, the present invention provides pharmaceutical 

compositions comprising compounds that inhibit the activities of Alloiococcus otitidis 
polypeptides, and physiologically acceptable carriers. Compounds that inhibit the 
activities of Alloiococcus otitidis polypeptides polypeptides, which are essential for 

15 the proliferation of the bacteria, are identified using one or more assay systems set 
forth in Examples 5-38. More preferably, the pharmaceutical compositions comprise 
one or more compounds that inhibit the activities of Alloiococcus otitidis polypeptides 
comprising the amino acid residue sequence of one or more of the even numbered 
sequences set forth in SEQ ID NO: 2 through SEQ ID NO: 1 06. In other 

20 embodiments, the pharmaceutical compositions of the invention comprise antisense 
polynucleotides of polynucleotides selected from one of the odd numbered 
sequences set forth in Seq. ID NO. 1 to Seq. ID No. 105, and physiologically 

acceptable carriers. 

Various tests are to be used to assess the in vitro and in vivo efficacy of 

25 anitmicrbbiai and pharmaceutical compounds that inhibit the activities of Alloiococcus 
otitidis polypeptides, and these are set forth in detail in Examples 5 through 38. For 
example, an in vitro activity of the compounds may be assayed by incubating 
together a mixture of Alloiococcus otitidis or other heterologous bacterial cells such 
as E. coii cells expressing Alloiococcus otitidis polypeptides set forth in one of the 

30 even numbered sequences from Seq. ID No. 2 to Seq. ID No. 106, and then 

measuring the activity of the polypeptide using one or more of the assay systems 
detailed in Example 5 through 38. 

The Alloiococcus otitidis polynucleotides, polypeptides, compounds that 
modulate the activity of an Alloiococcus otitidis polypeptides, and an\\-A!loiococcus 
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otitidis antibodies (also referred to herein as "active compounds 0 ) of the invention can 
be incorporated into pharmaceutical compositions suitable for administration to a 
host or subject, e.g., a human. Such compositions typically comprise the nucleic acid 
molecule, protein, antimicrobial compound, or antibody and a pharmaceutical^ 
5 acceptable carrier. As used herein the language "pharmaceutical^ acceptable 
carrier" is intended to include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and absorption delaying agents, and the 
like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutical^ active substances is well known in the art. Except 
10 insofar as any conventional media or agent is incompatible with the active 
compound, such media can be used in the compositions of the invention. 
Supplementary active compounds can also be incorporated into the compositions. 

A pharmaceutical of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include 
15 parenteral, (e.g., intravenous, intradermal, subcutaneous, intraperitoneal), 

transmucosal (e.g., oral, rectal, intranasal, vaginal, respiratory), and transdermal 
(topical). Solutions or suspensions used for parenteral, intradermal, or subcutaneous 
application can include the following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol 
20 or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents 
such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or 
phosphates and agents for the adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
25 sodium hydroxide. The parenteral preparation can be enclosed in ampoules, 
disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile 
aqueous solutions (where water-soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
30 intravenous administration, suitable carriers include physiological saline, 

bacteriostatic water, Cremophor EL™(BASF, Parsippany, NJ) or phosphate buffered 
saline (PBS). In all cases, the composition must be sterile and should be fluid to the 
extent that easy syringability exists. It must be stable under the conditions of 
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can be included as part of the composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or compounds of a similar 
nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
5 Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a 
giidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or 
saccharin; or a flavoring agent such as peppermint, methyl salicylate, or orange 
flavoring. 

For administration by inhalation, the compounds are delivered in the form of 
10 an aerosol spray from pressured container or dispenser that contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. Systemic 
administration can also be by transmucosal or transdermal means. For transmucosal 
or transdermal administration, penetrants appropriate to the barrier to be permeated 
are used in the formulation. Such penetrants are generally known in the art, and 
15 include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. 

20 The compounds can also be prepared in the form of suppositories (e.g., with 

conventional suppository bases such as cocoa butter and other glycerides) or 
retention enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
25 release formulation, including implants and microencapsulated delivery systems. 

Biodegradable, biocompatible polymers can be used, such as ethylene vinyl 
acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic 
acid. Methods 

30 H. Diagnostic Assays 

The invention also provides methods for detecting the presence of an 
Alloiococcus otitidis polypeptide or Alloiococcus otitidis polynucleotide, or fragment 
thereof, in a biological sample. The method involves contacting the biological sample 
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with a compound or an agent capable of detecting an Alloiococcus otitidis 
polypeptide or mRNA such that the presence of the Alloiococcus otitidis 
polypeptide/encoding nucleic acid molecule is detected in the biological sample. A 
preferred agent for detecting Alloiococcus otitidis mRNA or DNA is a labeled or 

5 labelable oligonucleotide probe capable of hybridizing to Alloiococcus otitidis mRNA 
or DNA. The nucleic acid probe can be, for example, a full-length Alloiococcus 
otitidis polynucleotide of one of the odd numbered sequences set forth in SEQ ID 
NO: 1 through SEQ ID NO: 105, a complement thereof, or a fragment thereof, such 
as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

10 sufficient to specifically hybridize under stringent conditions to Alloiococcus otitidis 
mRNA or DNA. Alternatively, the sample can be contacted with an oligonucleotide 
primer of an Alloiococcus otitidis polynucleotide of SEQ ID NO: 1 through SEQ ID 
:105, a complement thereof, or a fragment thereof, in the presence of nucleotides 
and a polymerase, under conditions permitting primer extension. 

15 A preferred agent for detecting Alloiococcus otitidis polypeptide is a labeled or 

labelable antibody capable of binding to an Alloiococcus otitidis polypeptide. 
Antibodies can be polyclonal, or more preferably, monoclonal. An intact antibody, or 
a fragment thereof (e.g., Fab or F(ab')2) can be used. The term "labeled or 
labelable," with regard to the probe or antibody, is intended to encompass direct 

20 labeling of the probe or antibody by coupling (/.a, physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 

25 detected with fluorescently labeled streptavidin. The term "biological sample" is 

intended to include tissues, cells and biological fluids isolated from a subject, as well 
as tissues, cells and fluids present within a subject. That is, the detection method of 
the invention can be used to detect Alloiococcus otitidis mRNA, DNA or protein in a 
biological sample in vitro as well as in vivo. For example, in vitro techniques for 

30 detection of Alloiococcus otitidis mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of Alloiococcus otitidis polypeptide 
include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations and immunofluorescence. Alternatively, Alloiococcus otitidis 



-61- 



WO 03/104391 



PCT/US02/36122 



polypeptides can be detected in vivo in a subject by introducing into the subject a 
labeled an\\-Alloiococcus otitidis antibody. For example, the antibody can be labeled 
with a radioactive marker whose presence and location in a subject can be detected 
by standard imaging techniques. 
5 The polynucleotides according to the invention may also be used in analytical 

DNA chips, which allow sequencing, the study of mutations and of the expression of 
genes, and which are currently of interest given their very small size and their high 
capacity in terms of number of analyses. 

The principle of the operation of these chips is based on molecular probes, 
10 most often oligonucleotides, which are attached onto a miniaturized surface, 

generally of the order of a few square centimeters. During an analysis, a sample 
containing fragments of a target nucleic acid to be analyzed, for example DNA or 
RNA labeled, for example, after amplification, is deposited onto the DNA chip in 
which the support has been coated beforehand with probes. Bringing the labeled 
15 target sequences into contact with the probes leads to the formation, through 

hybridization, of a duplex according to the rule of pairing defined by J.D. Watson and 
F. Crick. After a washing step, analysis of the surface of the chip allows the effective 
hybridizations to be located by means of the signals emitted by the labels tagging the 
target. A hybridization fingerprint results from this analysis which, by appropriate 
20 computer processing, will make it possible to determine information such as the 
presence of specific fragments in the sample, the determination of sequences and 
the presence of mutations. 

The chip consists of a multitude of molecular probes, precisely organized or 
arrayed on a solid support whose surface is miniaturized. It is at the center of a 
25 system where other elements (imaging system, microcomputer) allow the acquisition 
and interpretation of a hybridization fingerprint. 

The hybridization supports are provided in the form of flat or porous surfaces 
(pierced with wells) composed of various materials. The choice of a support is 
determined by its physicochemical properties, or more precisely, by the relationship 
30 between the latter and the conditions under which the support will be placed during 
the synthesis or the attachment of the probes or during the use of the chip. It is 
therefore necessary, before considering the use of a particular support, to consider 
characteristics such as its stability to pH, its physical strength, its reactivity and its 
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chemical stability as well as its capacity to nonspecifically bind nucleic acids. 
Materials such as glass, silicon and polymers are commonly used. Their surface is, 
in a first step, called "functionalization", made reactive towards the groups which it is 
desired to attach thereon. After the functionalization, so-called spacer molecules are 

5 grafted onto the activated surface. Used as intermediates between the surface and 
the probe, these molecules of variable size render unimportant the surface properties 
of the supports, which often prove to be problematic for the synthesis or the 
attachment of the probes and for the hybridization. 

Among the hybridization supports, there may be mentioned glass which is 

10 used, for example, in the method of in situ synthesis of oligonucleotides by 

photochemical addressing developed by the company Affymetrix (E.L. Sheldon, 
1993), the glass surface being activated by silane. Genosensor Consortium 
(P. Merel, 1 994) also uses glass slides carrying wells 3 mm apart, this support being 
activated with epoxysilane. 

15 The probes according to the invention may be synthesized directly in situ on 

the supports of the DNA chips. This in situ synthesis may be carried out by 
photochemical addressing (developed by the company Affymax (Amsterdam, 
Holland) and exploited industrially by its subsidiary Affymetrix (United States)) or 
based on the VLSI PS (very large scale immobilized polymer synthesis) technology 

20 (S.P.A. Fodor et a/., 1 991 ) which is based on a method of photochemically directed 
combinatory synthesis and the principle of which combines solid-phase chemistry, 
the use of photolabile protecting groups and photolithography. 

The probes according to the invention may be attached to the DNA chips in 
various ways such as electrochemical addressing, automated addressing or the use 

25 of probe printers (T. Livache et a/., 1994; G. Yershov et aL, 1996; J. Derisi et a/., 
1996, and S. Borman, 1996). 

The revealing of the hybridization between the probes of the invention, 
deposited or synthesized in situ on the supports of the DNA chips, and the sample to 
be analyzed, may be determined, for example, by measurement of fluorescent 

30 signals, by radioactive counting or by electronic detection. 

The use of fluorescent molecules such as fluorescein constitutes the most 
common method of labeling the samples. It allows direct or indirect revealing of the 
hybridization and allows the use of various fluorochromes. 
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Affymetrix currently provides an apparatus or a scanner designed to read its 
Gene Chip™ chips. It makes it possible to detect the hybridizations by scanning the 
surface of the chip in confoca! microscopy (R.J. Lipshutz et a/., 1995). 

The nucleotide sequences according to the invention are also used in DNA 
5 chips to carry out the analysis of the expression of the Alloiococcus otitidis genes. 
This analysis of the expression of Alloiococcus otitidis genes is based on the use of 
chips where probes of the invention, chosen for their specificity to characterize a 
given gene, are present (DJ. Lockhart et at., 1996; D.D. Shoemaker ef a/., 1996). 
For the methods of analysis of gene expression using the DNA chips, reference may, 
10 for example, be made to the methods described by D.J. Lockhart et al. (1 996) and 
Sosnowsky et al (1 997) for the synthesis of probes in situ or for the addressing and 
the attachment of previously synthesized probes. The target sequences to be 
analyzed are labeled and in general fragmented into sequences of about 50 to 
100 nucleotides before being hybridized onto the chip. After washing as described, 
15 for example, by D.J. Lockhart et al. (1 996) and application of different electric fields 
(Sosnowsky ef a/., 1997), the labeled compounds are detected and quantified, the 
hybridizations being carried out at least in duplicate. Comparative analyses of the 
signal intensities obtained with respect to the same probe for different samples 
and/or for different probes with the same sample, determine the differential 
20 expression of RNA or of DNA derived from the sample. 

The nucleotide sequences according to the invention are, in addition, used in 
DNA chips where other nucleotide probes specific for other microorganisms are also 
present, and allow the carrying out of a serial test allowing rapid identification of the 
presence of a microorganism in a sample: 
25 Accordingly, the subject of the invention is also the nucleotide sequences 

according to the invention, characterized in that they are immobilized on a support of 
a DNA chip. 

The DNA chips, characterized in that they contain at least one nucleotide 
sequence according to the invention, immobilized on the support of the said chip, 
30 also form part of the invention. 

The chips preferably contain several probes or nucleotide sequences of the 
invention of different length and/or corresponding to different genes so as to identify, 
with greater certainty, the specificity of the target sequences or the desired mutation 
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in the sample to be analyzed. 

Accordingly, the analyses carried out by means of primers and/or probes 
according to the invention, immobilized on supports such as DNA chips, make it 
possible, for example, to identify, in samples, mutations linked to variations such as 
5 intraspecies variations. These variations may be correlated or associated with 
pathologies specific to the variant identified and make it possible to select the 

appropriate treatment. 

The invention thus comprises a DNA chip according to the invention, 
characterized in that it contains, in addition, at least one nucleotide sequence of a 
10 microorganism different from Alloiococcus otitidis, immobilized on the support of the 
said chip; preferably, the different microorganism is chosen from an associated 
microorganism, a bacterium of the Streptococcus family, and a variant of the species 

Alloiococcus otitidis. 

The principle of the DNA chip as explained above, is also used to produce 

15 protein "chips" on which the support has been coated with a polypeptide or an 

antibody according to the invention, or arrays thereof, in place of the DNA. These 
protein "chips" make it possible, for example, to analyze the biomolecular interactions 
(BIA) induced by the affinity capture of target analytes onto a support coated, for 
example, with proteins, by surface plasma resonance (SPR). Reference may be 

20 made, for example, to the techniques for coupling proteins onto a solid support which 
are described in EP 524 800 or to the methods describing the use of biosensor-type 
protein chips such as the Bl Acore-type technique (Pharmacia) (Arlinghaus et aL, 
1997, Krone et al., 1997, Chatelier etal., 1995). These polypeptides or antibodies 
according to the invention, capable of specifically binding antibodies or polypeptides 

25 derived from the sample to be analyzed, are thus used in protein chips for the 

detection and/or the identification of proteins in samples. The said protein chips may 
in particular be used for infectious diagnosis and preferably contain, per chip, several 
polypeptides and/or antibodies of the invention of different specificity, and/or 
polypeptides and/or antibodies capable of recognizing microorganisms different from 

30 Alloiococcus otitidis. 

Accordingly, the subject of the present invention is also the polypeptides and 
the antibodies according to the invention, characterized in that they are immobilized 
on a support, in particular, on a protein chip. 
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The protein chips, characterized in that they contain at least one polypeptide 
or one antibody according to the invention immobilized on the support of the said 
chip, also form part of the invention. 

The invention comprises, in addition, a protein chip according to the invention, 
5 characterized in that it contains, in addition, at least one polypeptide of a 

microorganism different from Alloiococcus otitidis or at least one antibody directed 
against a compound of a microorganism different from Alloiococcus otitidis, 
immobilized on the support of the chip. 

The invention also relates to a kit or set for the detection and/or the 
10 identification of bacteria belonging to the species Alloiococcus otitidis or to an 
associated microorganism, or for the detection and/or the identification of a 
microorganism characterized in that it comprises a protein chip according to the 
invention. 

The present invention also provides a method for the detection and/or the 
15 identification of bacteria belonging to the species Alloiococcus otitidis or to an 
associated microorganism in a biological sample, characterized in that it uses a 
nucleotide sequence according to the invention. 

The invention also encompasses kits for detecting the presence of an 
Alloiococcus otitidis polypeptide in a biological sample. For example, the kit 
20 comprises reagents such as a labeled or labelable compound or agent capable of 

detecting Alloiococcus otitidis polypeptide or mRNA in a biological sample; means for 
determining the amount of Alloiococcus otitidis polypeptide in the sample; and means 
for comparing the amount of Alloiococcus otitidis polypeptide in the sample with a 
standard. The compound or agent are packaged in a suitable container.. The kit 
25 further comprises instructions for using the kit to detect Alloiococcus otitidis mRNA or 
protein. 

In certain embodiments, detection involves the use of a probe/primer in a 
polymerase chain reaction (PCR) (see, e.g. U.S. 4,683,195 and U.S. 4,683,202), 
such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction 
30 (LCR). This method includes the steps of collecting a sample of cells from a patient, 
isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more primers which specifically 
hybridize to an Alloiococcus otitidis polynucleotide under conditions such that 
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hybridization and amplification of the Alloiococcus otif/d/s-polynucleotide (if present) 
occurs, and detecting the presence or absence of an amplification product, or 
detecting the size of the amplification product and comparing the length to a control 
sample. 

5 

I. Transgenic Animals 

It is contemplated that in some instances the genome of a transgenic animal 
of the present invention will have been altered through the stable introduction of one 

10 or more of the Alloiococcus otitldis polynucleotide compositions described herein, 
either native, synthetically modified or mutated. As described herein, a "transgenic 
animal" refers to any animal, preferably a non-human mammal (e.g. mouse, rat, 
rabbit, squirrel, hamster, rabbits, guinea pigs, pigs, micro-pigs, baboons, squirrel 
monkeys and chimpanzees, etc), bird or an amphibian, in which one or more cells 

15 contain a heterologous nucleic acid sequence introduced by way of human 

intervention, such as by transgenic techniques well known in the art. The nucleic acid 
is introduced into the cell, directly or indirectly, by introduction into a precursor of the 
cell, by way of deliberate genetic manipulation, such as by microinjection or by 
infection with a recombinant virus. The term genetic manipulation does not include 

20 classical crossbreeding, or in vitro fertilization, but rather is directed to the 

introduction of a recombinant DNA molecule. This molecule may be integrated within 
a chromosome, or it may be extrachromosomally replicating DNA. 

The host ceils of the invention are also used to produce non-human 
transgenic animals. The non-human transgenic animals are used in screening 

25 assays designed to identify infections or compounds, e.g., drugs, pharmaceuticals, 
efc, which are capable of ameliorating Alloiococcus otitidis symptoms or infections. 
For example, in one embodiment, a host cell of the invention is a fertilized oocyte or 
an embryonic stem cell into which an Alloiococcus otitidis polypeptide-coding 
sequence has been introduced. Such host cells are then used to create non-human 

30 transgenic animals in which exogenous Alloiococcus otitidis gene sequences have 
been introduced into their genome or homologous recombinant animals in which 
endogenous Alloiococcus otitidis gene sequences have been altered. Such animals 
are useful for studying the effects of an Alloiococcus otitidis polypeptide and for 
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identifying and/or evaluating modulators of Alloiococcus otitidis polypeptide 
infectivity. 

A transgenic animal of the invention is created by introducing an Alloiococcus 
otitidis polypeptide-encoding nucleic acid sequence into the male pronuclei of a 

5 fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to 
develop in a pseudopregnant female foster animal. The human Alloiococcus otitidis 
cDNA sequence of one or more of SEQ ID NO:1 through SEQ ID NO: 4035 can be 
introduced as a transgene into the genome of a non-human animal. 

Moreover, a non-/4//o/ococci/s otitidis homologue of the Alloiococcus otitidis 

10 gene can be isolated based on hybridization to the Alloiococcus otitidis cDNA 

(described above) and used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression 
of the transgene. A tissue-specific regulatory sequence(s) can be operably linked to 
the Alloiococcus otitidis transgene to direct expression of an Alloiococcus otitidis 

15 polypeptide to particular cells. Methods for generating transgenic animals via embryo 
manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and are described, for example, in U.S. 4,736,866 and 4,870, 
009, U.S. 4,873,191 and in Hogan, 1986. Similar methods are used for production of 
other transgenic animals. A transgenic founder animal can be identified based upon 

20 the presence of the Alloiococcus otitidis transgene in its genome and/or expression 
of Alloiococcus otitidis mRNA in tissues or cells of the animals. A transgenic founder 
animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a transgene encoding an Alloiococcus otitidis 
polypeptide can further be bred to other transgenic animals carrying other 

25 transgenes. 

In another embodiment, transgenic non-human animals can be produced 
which contain selected systems that allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase system of bacteriophage 
PA. For a description of the cre/loxP recombinase system, see, e.g., Lakso et a/., 
30 1 992. Another example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gon-nan et al., 1991). If a cre/loxP recombinase 
system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. 
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Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene 
encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

5 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et aL, 1997, and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a celt, e.g., 
a somatic cell, from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter G Q phase. The quiescent cell can then be fused, e.g., through 

10 the use of electrical pulses, to an enucleated oocyte from an animal of the same 
species from which the quiescent cell is isolated. The reconstructed oocyte is then 
cultured such that it develops to morula or blastocyst and then transferred to 
pseudopregnant female foster animal. The offspring borne of this female foster 
animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 

15 isolated. 

All patents and publications cited herein are hereby incorporated by 
reference. 

The following examples are carried out using standard techniques, which are 
well known and routine to those of skill in the art, except where otherwise described 
20 in detail. The following examples are presented for illustrative purposes, and should 
not be construed in any way limiting the scope of this invention. 

Example 1 

Confirmation of the identity of the Alloiococc us otitidis 1 1 04-92 isolate 

25 

The Alloiococcus otitidis isolate 1 104-92 was obtained from Dr. Richard 
Facklam of the Centers for Disease Control in Atlanta. It was isolated from the middle 
ear fluid of a child in the Atlanta, Georgia area. It was confirmed to be A otitidis by 
comparing it to the type strain, ATCC51267, obtained from the American Type 
30 Culture Collection [Aguirre, 1 992 #1]. Both the 1 1 04-92 and type strain are 

characterized as Gram positive cocci. Both grow on Columbia agar supplemented 
with 5% yeast extract, 0.5% polysorbate 80 (Tween 80), and 0.7% phospatidyl 
choline when incubated at 37°C. On this medium, both strains form slow growing 
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small white colonies that require nearly two days to be easily observed with the 
naked eye. Both are sensitive to lysis by hen egg white lysozyme and Streptococcus 
globisporus mutanoiysin. Both grow in the presence of 2% sodium azide. Both are 
killed by incubation at 55°C for 30 minutes. Finally, to further confirm that the 1 1 04- 
5 92 was a strain of A. otitidis, it was subject to polymerase chain reaction (PCR) 
identification based on its 1 6s rRNA gene. This was done using two of the primers 
specified by Aguirre and Collins [Aguirre, 1992 #2]. The antisense primer used was 
5'- ATCTTCCTG CTTG CAG G AAG AG G -3' and the sense primer was 
3'-CGCTTCATCTCTGAAGCTAGC-5\ Thus by multiple criteria, the 1 1 04-92 
10 strain was confirmed to be an isolate of A. otitidis. 



EXAMPLE 2 

Storage, growth, and harvest of Alloiococcus otitidis 1 1 04-92 for isolation 

OF DNA 

15 

The A. otitidis isolate 1 1 04-92 was stored at -70°C in Todd-Hewlett broth 
containing 40% glycerol. A small portion of the frozen stock was streaked onto the 
agar medium described in Example 1 and incubated at 37°C for two days. The 
growth from the plate was swabbed into a 17 x 100 cm tube containing 6 ml of a 

20 serum-free broth medium. This broth medium was prepared with 30 g Todd-Hewlett 
medium, 5 g yeast extract, 10 ml polysorbate 80 (Tween 80), and 1 liter distilled 
water. This medium was sterilized by autoclaving for 35 minutes. The bacteria were 
incubated aerobically without shaking in an aerobic incubator at 37°C for two days. 
The tube containing the growing bacteria was then shaken to resuspend the bacteria 

25 and added to a liter of the same medium in a Fembach flask. This flask, in turn, was 
incubated aerobically for three days without shaking. The bacteria were harvested 
by first swirling the flask to suspend the bacteria and then low speed centrifugation at 
about 5,000 x g for 30 minutes. The pellet of bacteria was washed by resuspending 
it in 10 to 15 mL of phosphate buffered saline (PBS), and centrifuging the suspension 

30 at about 8,000 x g for 20 minutes. The pellet of bacteria was retained and stored 
frozen at -20°C. The yield of wet bacterial pellet was typically about 1 g per liter of 
broth. 
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EXAMPLE 3 

PREPARATION OF ALLOIOCOCCUS OTITID IS GENOMIC DN A 

To prepare genomic DNA, 0.95 g frozen peliet of bacteria was defrosted and 
5 suspended in 10 mL of PBS containing 1 mM MgCI 2 . The bacteria were killed by 
incubating the suspension at 55°C for 20 minutes. The suspension was allowed to 
cool before adding 25 |xl of a 10 mg/mL stock of hen egg white lysozyme and 50 \i\ of 
a 25,000 unit/mL stock of Streptococcus globisporus mutanolysin to the suspension. 
It was then incubated for one hour at 37°C. Then 50 \i\ of a 1 0 mg/mL stock of 
10 RNase was added and the suspension incubated an additional hour at 37°C. After 
these incubations, sodium dodecylsulfate (SDS) was added to a final concentration 
of 0.3% (0.3 mL of a 10% stock). This was followed by the addition of 0.3 mL of a 1 
mg/mL stock of proteinase K. The suspension was then incubated for two hours at 
37°C. After this time, an equal volume of water saturated phenol/chloroform/isopropyl 
15 (25:24:1) was added to the digested suspension and gently mixed. The upper 
aqueous layer was retained after a low speed centrifugation and 2.5 volumes of 
ethanol were added and the tube gently inverted to mix. The DNA was then spooled 
out on a glass rod and allowed to air dry. 

The DNA at this stage still contained obvious impurities and needed further 
20 purification. The DNA dried on the glass rod was soaked in 70% ethanol to remove 
excess phenol and air-dried once again. It was then suspended in 2 ml of Tris-EDTA 
buffer to which 2 [i\ of RNase cocktail was added and incubated at room temperature 
for 75 minutes. Then 100 *il of protease, 100 \i\ SDS and 40 \i\ of 100 mM CaCI 2 
were added and the suspension incubated for 3.5 hours. An equal volume of 
25 chloroform was added, gently mixed, then centrifuged at a low speed. The aqueous 
layer was collected and re-extracted with the phenol, chloroform, isopropyl alcohol 
reagent In turn, the aqueous layer was extracted with chloroform. At this point, 3 M 
sodium acetate was added to the aqueous phase collected form the last extraction 
and then 3.75 ml of ethanol was added and gently mixed. The DNA was spooled out, 
30 soaked in 70% ethanol and allowed to air-dry. The DNA was finally suspended in 2 
mi of Tris-EDTA buffer. Based on absorption at 260 nm, the final yield of DNA was 
482 ^tg of DNA. The DNA was confirmed to be that of A otitidis by the PCR method 
described in example 1. This DNA was submitted for sequencing. 
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Example 4 

Cloning And Sequencing Alloiococcus ormpis Genome 

5 This invention provides nucleotide sequences of the genome of Alloiococcus otitidis 
which thus comprises a DNA sequence library of Alloiococcus otitidis genomic DNA. 
The detailed description that follows provides nucleotide sequences of Alloiococcus 
otitidis, and also describes how the sequences were obtained and how ORFs (Open 
Reading Frames) and protein-coding sequences can be identified. 

10 To construct a library, genomic DNA was hydrodynamically sheared in an 

HPLC and then separated on a standard 1% agarose gel. A fraction corresponding 
to 3000-3500 bp in length was excised from the gel and purified by the GeneClean 
procedure (BIO101, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA 

15 polymerase. The blunt-ended DNA was then ligated to unique BstX1 -linker adapters. 
These linkers are complimentary to the pGTC vector, while the overhang is not self- 
complimentary. Therefore, the linkers will not concatermerize nor will the cut-vector 
religate itself easily. The liner-adapted inserts were separated from the 
unincorporated linkers on a 1% agarose gel and again purified using GeneClean. 

20 The linker-adapted inserts were then ligated to BstX1-cut vector to construct 
"shotgun 0 subclone libraries. 

Only major modifications to the protocols are highlighted. Briefly, the library 
was transformed into DH10B competent cells (Gibco/BRL, DH5a transformation 
protocol). Transformed cells were detected by plating onto antibiotic plates 

25 containing ampicillin. The plates were incubated overnight at 37° C. Transformant 
clones were then selected for sequencing. The cultures were grown overnight at 
37°C. DNA was purified using a silica bead DNA preparation (Egelstein, 1 996) 
method. In this manner, 25 mg of DNA was obtained per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 

30 chemistry. All subsequent steps were based on sequencing by automated DNA 
sequencing methods. The ABI dye terminator sequence reads were run on 
MegaBace™ 10000 (Amersham) machines and the data transferred to UNIX based 
computers. Base calls and quality scores were determined using the PHRED 
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software program (Ewing et al., 1998, Genome Res. 8: 175-185; Ewing and Green, 
1998, Genome Res. 8:685-734). Reads were assembled using PHRAP (P. Green, 
Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, Jan. 
1996, p 157) with default program parameters and quality scores. 

5 To identify Alloiococcus otitidis genome encoded polypeptides, the complete 

genomic sequence of Alloiococcus otitidis was analyzed essentially as follows: First, 
all possible stop-to-stop open reading frames (ORFs) > 222 nucleotides in all three 
reading frames were translated into amino acid sequences. 

Second, the identified ORFs were analyzed for homology to known protein 

10 sequences. Third, the coding potential of non-homologous sequences were 

evaluated with the GENEMARKTM software program (Borodovsky and Mclninch, 
1993, Comp. Chem. 17:123). The results of these analysis are set forth in tables 2- 
16. 

15 Example 5 

Identification of specific genes in Alloiococcus otitidis 

Alloiococcus otitidis homologs of the genes listed in Table 4 were identified as 

follows: 

20 Protein sequences of interest ("query sequences", Table 4) were extracted 

from Genbank from one or more species; query species included but were not limited 
to Staphylococcus aureus, Streptococcus pnuemoniae, Streptococcus pyogenes, 
Lactococcus lactis t Escherichia coli, and Bacillus subtilis. These queries were 
compared to the Alloiococcus otitidis sequence by several methods in order to 

25 determine which Alloiococcus sequence was the ortholog for the query gene. 

First, the query sequences were compared to the translated Alloiococcus 
otitidis ORF set using BLASTP. The ORF set was generated as described in 
Vaccines patent, except that for each ORF that had multiple potential start codons, 
only the longest ORF was used. The top 1 0 Alloiococcus otitidis hits for each query 

30 were saved, without regard to score. 

These Alloiococcus otitidis hits were then compared to NR, the nonredundant 
Genpept database, using BLASTP. An Alloiococcus otitidis ORF was considered the 
ortholog of a query sequence if the genes were reciprocal best hits in Alloiococcus 
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otitidis and the query genome. This analysis is also sumarized in Table 4 (excel file 
AOT_PATENT_FILE.xls, Sheet TopHitsAndClustalKey). Specific numerical cutoffs 
were not used; however all top hits had Expect values of less than 3x1 CT 28 . 

Several query sequences had more than one high-scoring hit in Alloiococcus 
5 otitidis. In most cases, however, only the first, best hit to the original query sequence 
had that query sequence as its reciprocal best hit For example, the Streptococcus 
pyogenes query sequence GyrA (alpha subunit of DNA gyrase) has two high-scoring 
hits in Alloiococcus otitidis. These were distinguished by the reciprocal blast 
analysis; the first, ORF_505 (60% identity, Expect = 0) is the GyrA homoiog and the 
10 second, ORF_1 907 (38% identity, Expect = 1 x 1 0 ~ 154 ) is the homoiog of the query 
sequence GrIA or ParC (topoisomerase IV, A subunit). Other examples of closely 
related proteins include the B subunits of DNA gyrase (GyrB) and Topoisomerase IV 
(GrIB or ParE); and YphC and Era, both of which are putative GTP binding proteins 
of unknown function. These Alloiococcus otitidis ORFS were assigned based on 
15 their top hit in Genpept. 

In two cases the multiple high-scoring hits in Alloiococcus otitidis were the 
result of gene duplication. In the case of MurA (UDP-N-acetylglucosamine 
enolpyruvyl transferase) two separate Alloiococcus otitidis ORFS were determined to 
be the desired orthologs, because both had MurA (or MurZ, alternate notation) as 
20 their best hit in Genpept. Likewise, there are two FoIC (folylpolyglutamate synthase) 
homologs in Alloiococcus otitidis. It is known that other bacteria, particularly Gram- 
positive bacteria, may carry two homologs of each of these genes. 

As a further step in verification of gene assignments, the Alloiococcus otitidis 
ORFS identified as orthologs of the query genes by the analysis above were then 
25 compared to an internal copy of the COGS database (Tatusov RL, Natale DA, 

Garkavtsev IV, TatusovaTA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, 
Fedorova ND, Koonin EV, 2001, Nucleic Acids Res 2001 Jan 1;29(1):22-8. The 
COG database: new developments in phylogenetic classification of .proteins from 
complete genomes) using BLASTP. The COGS database is a curated set of proteins 
30 from a set of finished bacterial genomes, which have been grouped into specific 
protein families on the basis of protein similarity. In all cases, the Alloiococcus 
otitidis ORF was most closely related to the COGS family of the initial query protein, 
if that protein had been assigned to a COGS family. Examples of proteins for which 
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there is no COGS family defined (in our local version of the database) include SrtA 
(sortase) and MvaK1 (phosphomevalonate kinase). 

As a final confirmation, all query proteins were compared to the complete 
Alloiococcus otitidis nucleotide sequence using TBLASTN, in order to determine if 
5 there were additional and/or better hits that had not been predicted as ORFS. In all 
cases, the same sequence was identified as the best hit by TBLASTN and by 
BLASTP. 

For one query sequence, sortase, the Alloiococcus otitidis ORF that was the 
top hit (Expect = 0.42) by the initial BLASTP or TBLASTN using the Staphylococcus 

10 aureus sortase sequence as query was found by additional analysis (reciprocal blast) 
to be a putative ABC-transport protein. The true sortase homolog in Alloiococcus 
otitidis was identified by construction of a Hidden Markov Model based on a multiple 
alignment of 72 known and putative sortase proteins that had been identified 
previously using similar computational methods. The model was constructed using 

15 "hmmbuild" and the Alloiococcus otitidis ORF set was searched using "hmmsearch", 
both of the hmmer package (S.R. Eddy. Profile hidden Markov models. 
Bioinformatics 14:755-763, 1998). The assignment of ORF_876 as sortase was then 
confirmed by reciprocal blast as described above and in Table 2. ORF_876 was also 
found to be the top hit in Alloiococcus otitidis when the Bacillus subtilis putative 

20 sortase (YhcS) was used as the query sequence in a BLASTP search. The Bacillus 
halodurans BH3596 Bacillus subtilis YhcS and proteins that are the top hits for 
RF_876 have recently been placed into a COGS group of sortases, further 
confirming the identity of ORF_876 as the Alloiococcus otitidis sortase. 
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DNA 


orf" j 


Protein 


orf n 
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NO. 


SEQ ID NO.r 


Start j 
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SEQ ID No. I 


Gene 
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Seq. ID No. 1 
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Seq. ID No. 2 
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murF 


57b 
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rpoE 
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88726 
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111542 
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rpoC 
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Seq. ID No. 17 
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256246 


254297 


Seq. ID No. 20 


gyrB 
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Seo. ID No. 21 


259131 


259116 


257914 


Seq. ID No. 22 


dnaN 


528b 


Seq. ID No. 23 


263837 
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Seq. ID No. 24 
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Seq. ID No. 25 
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Seq. ID No. 26 
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Seq. ID No. 27 
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Seq. ID No. 28 
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Seq. ID No. 29 
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folC-1 
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Seq. ID No. 32 
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Seq. ID No. 36 
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Seq. ID No. 40 
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1273 
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684289 


Seq. ID No. 42 
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Seq. ID No. 43 
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Seq. ID No. 44 
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Seq. ID No. 45 
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Seq. ID No. 46 
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Seq. ID No. 47 
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Seq. ID No. 48 


coaA 
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Seq. ID No. 49 
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Seq. ID No. 50 
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Seq. ID No. 51 
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815920 


Seq. ID No. 52 
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Seq. ID No. 54 
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Seq. ID No. 56 


folA 
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Seq. ID No. 57 


1 040639 


1 040645 


1 042606 


Seq. ID No. 58 


GrIB 
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Seq. ID No. 59 


1 042729 


1 042732 


1045191 


Seq. ID No. 60 


grIA 
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(Cont'd.) 



1 OPP 


DMA "T 


ORF 


Protein 


ORF | 


Protein I 


1 


1 NO 


SEQ ID NO.r 


Start t 


Start 


Stop i 


SEQ ID No. I 


Gene I 


IQQfir 


Can ID MO 61 


1098801 


1 098798 


1 097689 


Seq. ID No. 62 


rpoD 


1QQ9h 


Con ID Nfl fiS 


1 100670 


1 1 00670 


1098817 


Seq. ID No. 64 


dnaG 


cUUO 


c pn in m 0 fig 


1109198 


1109144 


1108212 


Seq. ID No. 66 


era 


on! fih 

i on 


c pn ir\ No 67 


111 54*35 


111 5390 

ill »ju^v 


1 1 1 3879 


Seq. ID No. 68 


norA 


d IOO 


Can in Mn RQ 


1 17QQQ5 


1 179938 

11/ JU 


1 1 75604 


Sea. ID No. 70 


polC 


2 lo t D 


Can 1 r\ M r\ *71 

oeq. IU NO. f\ 


l cUODUO 


I £-UOOOO 


1?u?2R1 


Sea ID No 72 


oba 


2204 


oeq. IU NO. / o 


I ill DOilO 




1 > 1 


Sea ID No 74 


vnhC 


2240C 


oeq. IU NO. /O 




I (iOOOU / 




Sea ID No 76 


dnaE 


2284 


bGq. IU NO. / / 


H OR1 OPQ 


1 9fi1 nfi<i 


I ^JJOJO 


Sea ID No 78 


coaBC 




Cpa in No 79 


1286689 

1 i-m \J V^/ \J \*J \*S 


1 286668 


1285637 


Seq. ID No. 80 


holA 


2333 
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1290847 
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coaD 
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Seq. ID No. 83 
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1 374400 


1373168 


Seq. ID No. 84 
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1 375792 
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Seq. ID No. 86 
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Seq. ID No. 92 
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Seq. ID No. 94 
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holB 
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1684114 


1684102 


1 682330 


Seq. ID No. 106 


dnaX 
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Example 6 

Identification of the gene encoding Coenzyme A (CPA) in Alloiococcus 

onnDis 

10 

Pantothenate kinase (PanK, CoaA) encoded by the coaA gene catalyzes the 
initial step in Coenzyme A (CoA) biosynthesis. CoA is an essentia! co-factor in a 
number of metabolic pathways in bacteria and mammals. Short-chain thioesters 
15 such as acetyl-CoA and succinyl-CoA are essential intermediates in carbon 

metabolism. CoA-thioesters of long chain fatty acids feed into p-oxidation and are 
also the source of fatty acids for phospholipids. In addition, CoA and its thioesters 
play important roles in the regulation of several enzymes in intermediary metabolism, 
including pyruvate dehydrogenase and phosphoenolpyruvate carboxylase. Finally, 



77- 



WO 03/104391 



PCT/US02/36122 



synthesis of holo acyl carrier protein (ACP) is dependent on CoA for the 4'- 
phosphopantetheine moiety linked to ACP. ACP is essentia! for fatty acid 
biosynthesis. The two major acyl-carrier groups in cells: CoA and ACP, are derived 
from pantothenate. Pantothenate can be obtained exogenously through uptake via a 
5 permease, the product of the panF gene. Alternately, pantothenate is the product of 
condensation of pantoate and 0-alanine via pantothenate synthetase, the product of 
the panC gene. The initial step in CoA biosynthesis is the phosphorylation of 
pantothenate by pantothenate kinase (PanK, CoaA). 

The coaA gene was originally identified by Dunn and Sneli in S. typhimurium 
10 as a temperature sensitive allele. Similarly, a temperature sensitive allele of coaA 
was reported for E. coli in 1 987. CoaA was found to be essential in E. coli in a 
recent genetic footprinting analysis. In the temperature sensitive strains, 
accumulation of phosphorylated CoA intermediates rapidly ceased following shift to 
the non-permissive temperature. CoaA was shown to be a homo-dimer of 35 kDa 
15 subunits that bound ATP cooperatively. ATP is bound first in a sequential 

mechanism of action; CoA has been shown to be a potent inhibitor of the reaction 
and competitively competes for binding with ATP. Therefore CoaA is under feedback 
regulation and is the major regulatory step in CoA biosynthesis. 

Lysine 101 in bacterial pantothenate kinase (CoaA) was found to be essential 
20 for both ATP and CoA binding. This supports kinetic data that CoA is a competitive 
inhibitor of ATP binding to CoaA and that both substrates bind to the same site. 

Homologues of E. coli CoaA have been identified in B: subtilis, S. pyogenes, 
M. tuberculosis, H. influenzae and V. cholerae. Homologues have not been identified 
in either the S. cerevisiae genome or in a mammalian expressed sequence tag 
25 database. Calder et al. identified a homologue, through functional complementation 
of an E. coli coaA ts mutant, in A. nidulans. Homologue of this gene identified in 
Alloiococcus otitidis as described in Example 5 (Seq. ID No 47. The protein encoded 
by the gene is set forth in Seq. ID No. 48. 

The A. nidulans gene was then used to identify a yeast homologue. The 
30 bacterial and Aspergillus enzymes were found to be 16% identical and 32% similar. 
Although this level of similarity is quite weak the essential lysine residue involved in 
nucleotide binding appears to be conserved; however, the sequence surrounding the 
lysine residue were not conserved and further study will be required to validate this 
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finding. The most striking difference between the eukaryotic and prokaryotic 
enzymes is found in the sensitivity of each to competitive inhibition by CoA and 
acetyl-CoA. The yeast enzyme was most sensitive to acetyl-CoA and less sensitive 
to CoA, whereas the converse was true for the bacterial enzyme. Later studies 
5 demonstrated that mammalian pantothenate kinase is activated by CoA and inhibited 
by acetyl-CoA. 

Nucleotide binding 

Binding of ATP to CoaA is directly demonstrated by equilibrium dialysis 
10 employing the non-hydrolyzable ATP analogue ATPyS. The K<, measured for ATP 
binding is reported to be 2.1 pM. 

CoA binding 

Binding of CoA to CoaA is directyl demonstrated by equilibrium dialysis and 
15 the Kd is reported to be 6.7 pM. 

Pantothenate kinase activity 

Specific kinase activity of CoaA is demonstrated using D-[1- 14 C]pantothenate and 
capturing 4'-phospho[1 - 14 C]pantothenate on DE81 filters. Using this assay the 
20 following kinetic values were derived: specific activity - 470+/- 200 nmol/min/mg; 
pantothenate - 36 pM; Km ATP - 1 36 pM. 

Suitability of target for anti-infective development 

Coenzyme A biosynthesis is essential for bacterial viability. CoaA catalyzes the 
25 first step of biosynthesis of CoA and appears to be the point of regulation for the 
pathway. The essentiality of CoaA is demonstrated through the construction of 
temperature sensitive alleles in coaA. Although the yeast enzyme is found to- 
functionally complement the bacterial temperature sensitive allele, sequence and 
kinetic differences suggest the possibility of identifying inhibitors of the bacterial 
30 enzyme with high selectivity. As CoaA is essential and conserved in gram-negative 
and gram-positive pathogens, such inhibitors will have broad-spectrum utility. 
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Suitable assays for measuring CoaA function 

CoaA is purified by standard methods using widely available molecular tags 
following expression at high level from E. colL Pantothenate kinase activity is 
measured as follows: CoaA and D-[1- 14 C]pant6thenate is incubated in a buffer 
5 consisting of 100 mM Tris (pH 7.4), 2.5 mM MgCI 2 , 2.5 mM ATP for 5-60 minutes at 
37'C. Product, 4'-phospho[1 - 14 C] pantothenate, is monitored through retention of 
labeled material on DE81 filters. This assay is amenable to high-throughput 
screening using high-density well-filter plates. 

io Example 7 

Identification of the gene encoding CqaBC (Dfp) in Alloiococcu s otttidis 

The E. coli dfp gene, which encodes the previously designated Dfp protein, 
was originally identified as encoding an enzyme required for CoA biosynthesis. The 

15 gene, coding for the protein of interest, was renamed coaBC to reflect the enzyme 
function in CoA biosynthesis. CoA is an essential co-factor in a number of metabolic 
pathways in bacteria and mammals. Short-chain thioesters such as acetyl-CoA and 
succinyl-CoA are essential intermediates in carbon metabolism. 

CoaBC carries out the second and third steps of coenzyme A 

20 biosynthesis: the conjugation of 4'-phosphopantetheate with cysteine by the CoaB 
(PPCS : 4'phosphopantethenoyl cysteine synthase) activity followed by the 
conversion to 4 , -phosphopantetheine by the CoaC (PPCDC: 
4'phosphopantenoylcysteine decarboxylase) activity. Homologue of this gene 
identified in Alloiococcus otitidis as described in Example 5 (Seq. ID No 77). The 

25 protein encoded by the gene is set forth in Seq. ID No. 78. 

Enzyme activity of CoaBC (Dfp): 

Initially it was demonstrated that Dfp enzyme catalyzing oxidative 
30 decarboxylation of (R)-4'-phospho-N-pantothenoylcysteine (PPC) to form 4'- 

phosphopantetheine (PP) - the third step in CoA biosynthesis from pantothenate 
The Km for this reaction is 800 \M for PPC. 

Subsequently, it was established that Dfp is a Afunctional enzyme, catalyzing 
the second step of CoA biosynthesis, coupling of 4'-phosphopantothenate with 
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cysteine to form PPC, as well. This reaction is a two-step process and requires CTP 
for initial 4'-phosphopantothenate activation. Second step couples cysteine to the 
phosphopantothenate moiety with a release of CMP. Estimated Km s are 300 ]M for 
4'-phosphopantothenate and CTP, and 250 jiM for cysteine. 

5 

CoaBC as target for antibacterial development. 

Coenzyme A (CoA) plays a vital role in the metabolism of living cells. 
According to a recent report, 4% of all enzymes in the cell require CoA, its thioesters 

10 or 4'-phosphopantetheine. Recent genetic footprinting experiments on E. coli and 
direct gene knockout have established that this coaBC is essential for bacterial 
growth. Homologs of coaBC have been identified in a number of gram-positive and 
gram-negative organisms, which suggested the possibility of developing a broad- 
spectrum antibacterials from coaBC inhibitors. Considering the bifunctional nature of 

15 CoaBC, it is feasible to identify inhibitors that will inhibit both enzymic functions, thus 
arresting two steps in the CoA pathway. Another important factor in favor of 
selecting CoaBC as a target for antibacterials is low homology of the bacterial 
enzyme to eukaryotic counterparts. In most of the higher organisms including 
humans, two separate enzymes carry out these functions. Moreover, mammalian 

20 (R)-4'-phospho-N-pantothenoylcysteine decarboxylase is a pyruvate-dependent 
enzyme, while CoaBC requires flavine mononucleotide for its function. 

Assays for measuring CoaBC activity. 

25 PPC synthetase activity is be monitored by detecting the released 

pyrophosphate. This is achieved by converting pyrophosphate to inorganic phospate 
with pyrophosphatase and detection by the Malachite Green assay, or by the MESG 
assay spectrophotometrically. CoaBC (2 \ig) is incubated in the reaction buffer 
containing 10 mM DTT, 2 mM MgCI 2 , 50 mM Tris-HCI, pH 8, 300 \M 4'- 

30 phosphopantothenate, 3.5 mM CTP, 5 jj.g pyrophosphatase. The reaction is started 
by addition of appropriate amount (1 0-500 jj.M final) of cysteine. The reaction is 
stopped at different time points by addition of equal volume of 5M H 2 S0 4 . The 
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10 



amount of inorganic phosphate released will be determined according to the one of 
described techniques. 

PPC synthetase activity is also monitored. by detecting the release of carbon 
dioxide from 14 C-Iabeled cysteine. CoaBC (2 ^g) is incubated in the reaction buffer 
containing 10 mM DTT, 2 mM MgCI 2 , 50 mM Tris-HCI, pH 8, 2.5 fxM 4'- 
phosphopantothenate, 3.5 mM CTP. The reaction is started by addition of 
appropriate amount (30 mM, final concentration) of 14 C-labeled cysteine. The 
reaction is stopped at different time points by addition of equal volume of 5M H 2 S0 4 . 
Amount of released 14 C-labeled C0 2 is determined according to published technique. 

Example 8 



identification of the gene encoding phosphopantetheine adenylyltransferase 
15 (CoaD) in AUoiococcus otitidis 

Phosphopantetheine adenylyltransferase, (PPAT, CoaD, KdtB) catalyzes the 
penultimate step in Coenzyme A (CoA) biosynthesis. The fourth step in CoA 

20 biosynthesis is the addition of AMP to ^-phosphopantetheine by phosphopantetheine 
adenylyltransferase (CoaD) to form 3* dephospho-CoA (dPCoA). 

The coaD gene was first identified in E. coli by Geerlof et a/. CoaD is 
essential for viability in E. coli and S. aureus. The enzyme has a mass of 1 8 kDa and 
was determined to be a hexamer through cross-linking studies. Crystallography 

25 confirmed the oligomeric state of the enzyme. Moreover, co-crystallography of CoaD 
with dPCoA has also been carried out mapping the binding pocket for the major 
product of the reaction. Interestingly, in mammals PPAT has been shown to be in a 
complex with dephospho Coenzyme A kinase (dPCoA kinase, DPCK). This enzyme, 
purified from pig liver, is referred to as CoA Synthase. The yeast PPAT is associated 

30 with a protein complex that is in excess of 375 kDa and composed of six proteins. 
There is no detectable homology between the bacterial PPAT (CoaD) and the 
recently identified human PPAT, the activity of which is contained in a bifunctional 
PPAT/DPCK enzyme. Homologues of E. coli CoaD have been identified in P. 
aeruginosa, S. pneumoniae, S. aureus, H. influenzae, H. pylori, B. anthracis and M. 

35 tuberculosis. Homologue of this gene identified in AUoiococcus otitidis as described in 
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Example 5 (Seq. ID No 81). The protein encoded by the gene is set forth in Seq. ID 
No. 82. 

Enzyme activity 

5 CoaD (PPAT) carries out the reversible transfer of AMP to 4'- 

phosphopantetheine, forming dephosphocoenzyme A and releasing PPi. The 
reverse reaction was demonstrated by Geeriof et ai using a coupled assay to tie 
ATP production to NADP reduction, which is monitored at 340 nm. The following 
kinetic constants were calculated: kcat = 3.3 +/- 0.1 /sec; K m <dPcoA) = 7.0 +/- 1 .4 uM; 

10 K m (PPi) = 0.22 +/- 0.04 mM. 

CoaD as target for anti-infective development. 

Coenzyme A biosynthesis is essential for bacterial viability. CoaD, 
15 phosphopantetheine adenylyltransferase, catalyzes the fourth step in the pathway 
and was shown to be essential in both E. coli and S. aureus. There is no measurable 
homology between CoaD and the human PPAT enzyme, so the liability of poorly 
selective compounds is quite low. As CoaD is essential and conserved in gram- 
negative and gram-positive pathogens, inhibitors developed against this target will 
20 have broad-spectrum utility. 

Assays for measuring CoaD function 

CoaD will be expressed and purified using standard methodologies for 
bacterial expression and affinity tag-based purification. Two assay formats can be 

25 used to monitor enzymatic activity: the forward reaction and the reverse reaction. 

The forward reaction assay was initially described for measuring the activity 
of the human PPAT activity in the PPAT/DPCK enzyme. The enzyme assay is 
carried out in 50 mM Tris (pH 8.0), 2 mM MgCI 2 , 5 mM ATP, 5-500 uM 4'- 
phosphopantotheine, 7.5 mM NADH and enzyme (initially 0.1 - 1.0 Mg/ml). The 

30 production of PPi is detected using the protocol of O'Brien in which PPi production is 
coupled to the oxidation of NADH to NAD. This system requires the addition of 4 
enzymes (PPpdependent phosphofructokinase, aldolase, triosephophate isomerase 
and glycerol-3-P dehydrogenase) to the basic reaction mix and presents the added 
issue of deconvolution, which limits the use of the assay as a primary screen. 
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The reverse direction assay is carried out also as a coupled assay to tie ATP 
production to NADP reduction following the method described by Lamprecht & 
Trautschold. The assay is set up in reaction buffer containing the following: 50 mM 
Tris (pH 8.0), 1 mM DTT, 2 mM MgCI 2 , 1 mM NADP, 5 mM glucose, 2 mM PP ti 0.1 
5 mM dPCoA. Hexokinase (4 units) and glucose-6-phosphate dehydrogenase (1 unit) 
will be added to the assay as the coupling enzymes in addition to CoaD (initially 0.1 - 
1 pg/ml). The assay is monitored at 340 nm. Deconvolution of hits is required with 
this assay, however with only 2 additional enzymes the task will be less cumbersome 
when compared to the forward assay described above. 

10 

Example 9 

Identification of the gene encoding pephqsphoCoA ki nase (DPCK, YacE, 

COAE) IN ALLOIOCOCCUS OTITIDIS 

15 

DephosphoCoA kinase (DPCK, YacE, CoaE) encoded by the coaE gene 
catalyzes the final step in Coenzyme A (CoA) biosynthesis. The final step in CoA 
biosynthesis is the phosphorylation of the 3'-hydroxyl group of dephospho-CoA to 
20 form CoA by dephosphocoenzyme A kinase (DPCK, YacE, CoaE). 

The determination that the previously identified yacE gene encoded the 

♦ 

dephosphocoenzyme A kinase activity was reported by Mishra et ai These authors 
previously determined that separate enzymes encode the phosphopantetheine 
adenyltransf erase (PPAT) and dephosphocoenzyme A kinase (DPCK) activity in 

25 Corynebacterium ammoniagenes in contrast to the eukaryotic enzymes in which the 
PPAT and DPCK activities are coupled. The E. col) gene, encoding a 25 kDa 
protein, was cloned based on the sequence of the C. ammoniagenes gene and found 
to be identical to the previously described yacE gene. The gene was designated 
coaE to follow existing nomenclature in E. coll CoaE (YacE) was shown to be 

30 essential in E. coli through genetic footprinting. CoaE is widely distributed in 

bacteria. Homologue of this gene identified in Alloiococcus otitidis as described in 
Example 5 (Seq. ID No 93). The protein encoded by the gene is set forth in Seq. ID 
No. 94. 
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Assays for measuring CoaE function 

CoaE carries out the phosphorylation of dephosphocoenzyme A at the 3' 
hydroxyl group, consuming ATP, to form CoA. Dephosphocoenzyme A kinase 
activity is measured in a coupled reaction in which NADH oxidation to NAD is tied to 

5 ADP production. In this assay, the standard pyruvate kinase/lactose dehydrogenase 
coupling system is used to generate NAD in a 1:1 molar equivalent to the ADP 
produced by the test enzyme. NADH oxidation to NAD is monitored at 340 nm in a 
standard spectrophotometer. The following kinetic values were determined for CoaE: 
Km (atp) = 0.74 mM; K m(de phospho-coA)= 0.14 mM (7). 

10 The formation of CoA is monitored using a coupled enzyme system in which 

acetyl-CoA is formed in proportion to the amount of CoA in the assay. Three 
enzymes (phosphate acetyl transferase, citrate synthase and malate dehydrogenase) 
are added to the reaction that results in the formation of NADH from NAD, which is 
monitored at 340 nm. 

15 

CoaE as a target for anti-infective development 

Coenzyme A biosynthesis is essential for bacterial viability. CoaE, 
dephosphocoenzyme A kinase, catalyzes the final step in CoA synthesis and is 
shown to be essential by genetic footprinting in E. coli. A degree of homology 

20 between CoaE and the human DPCK enzyme has been noted, such that selectivity 
assays is necessary to determine a high therapeutic index for CoaE inhibitory 
compounds. CoaE is conserved in gram-negative and gram-positive pathogens and 
should have broad-spectrum utility in the clinic. 

CoaE is expressed and purified using standard methodologies for bacterial 

25 expression and affinity tag-based purification. DephosphocoA kinase activity is 
monitored using a coupled enzyme system to tie ADP production to oxidation of 
NADH to NAD. The decay of absorbance at 340 nm will be the assay readout. The 
assay will be setup in the following buffer: 50 mM Tris (pH 8.5), 20 mM KCI, 10 mM 
MgCI 2 , 10 mM ATP, 0.3 mM NADH and 0.4 mM phosphoenoipyruvate. The coupling 

30 enzymes: pyruvate kinase (10 U) and lactate dehydrogenase (4 U) will be added 

along with dephosphocoenzyme A kinase (initially 0.1- 1 .0 ug/ml). The assay will be 
started by the addition of 0.4 mM dephosphocoenzyme A. In this assay system, the 
release of ADP is tied to the oxidation of NADH to NAD, and is monitored at 340 nm. 
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This assay is transferable to a high-density microtiter plate format and suitable for 
HTS. 

Example 10 

5 Identification of dnaB and pcrA, genes encoding heli cases in Alloiococcus 

Helicases unwind double-stranded DNA in a reaction that couple nucleotide 
binding and hydrolysis to strand unwinding. Their activity is required for a number of 
10 biological processes such as separation of the chromosome during replication, 

recombination and repair. Homologue of thse genes were identified in Alloiococcus 
otitidis as described in Example 5 (Seq. ID No 1 5 and 99). The protein encoded by 
the gene is set forth in Seq. ID No. 16 and 100. 

15 Due to the essential roles modulated by these molecules they represent an 

important target for antibacterial therapy. Homologs of dnaB and pcrA genes 
encoding helicases were identified as described in Example 5. A primary assay, 
which detects helicase function in vitro, is used to identify inhibitors of each enzyme 
and is described below. 

20 Genes encoding DnaB and PcrA is obtained using polymerase chain reaction 

amplification of the genomic region encoding them. The genes is subcioned into a 
standard expression vector either containing an amino acid tag for ease of 
purification or not. The enzyme is then over-expressed in Escherichia coli and 
purified using a standard tag system. 

25 Most helicases require a region of single-stranded DNA flanking the duplex 

region that it unwinds. As a result, providing a single stranded region to either the 3' 
or 5' end of a duplex allows for determination of the polarity of helicase unwinding. 
These types of experiments have demonstrated that PcrA and DnaB are 3'-5' and 5'- 

4 

3' helicases, respectively. None the less, a convenient filtration assay has previously 
30 been described that is formatted for high-through-put screening of inhibitors of either 
enzyme, regardless of polarity. Assays (90 ul) contained 15 pM single-stranded M13 
DNA to which a radiolabeled oligonucleotide had been annealed as a substrate for 
unwinding. Reactions are carried out in 96-weli GF/C unifilter hydrophobic plates 
(Polyfiltronics Inc.) in 70 ul helicase buffer [20 mM Hepes (pH 7.6), 4 mM MgCI 2 4 



86- 



WO 03/104391 



PCT/US02/36122 



mM ATP, 1 00 ug/ml BSA, 5% glycerol and 2 mM DTT] and 1 0 ul of DMSO or 
compound. Reactions are initiated by adding 10 ul of purified helicase protein and 
are incubated for 1 hr at room temperature. 100 ul of 2X capture buffer containing 
silica beads [25% methanol, 3 M Nal, 0.03% NP-40, and 10% GlassFog beads 
(BIO101)] were added. The mixture was incubated for 30 min at room temperature. 
Plates are then washed 5X on a Bio-Teck instruments, Auto Washer EL403) with 
wash buffer (50% ethanol, 0.2% NP-40 and 50 mM NaCI). Scintillation fluid was 
added and plates are counted (Packard Topcount). 



10 EXAMPLE 1 1 




DnaE is an enzyme that catalyzes the DNA template directed polymerization 
15 of deoxyribonucleotides into deoxyribonucleic acid. The enzyme has been reported 
to modulate lagging strand synthesis at gram-positive replication forks. Functions for 
DnaE have been defined biochemically, in Bacillus subtilis and Streptococcus 
pyogenes. Homologue of this gene identified in Alloiococcus otitidis as described in 
Example 5 (Seq. ID No 75). The protein encoded by the gene is set forth in Seq. ID 
20 No. 76. 

Because DnaE is an essential protein in gram-positive bacteria and has high 
homology to the gram-negative dnaE, which is an essential polymerase subunit of 
the DNA polymerase III holoenzyme, it serves as a good target for antibacterial drug 
discovery. A primary assay, which detects processive DnaE mediated DNA 
25 synthesis in vitro, is useful identify inhibitors of the enzyme and is described below. 

The gene encoding DnaE I in Alloiococcus otitidis was identified as described 
in Example 5. Purification of DnaE DNA polymerase from Alloiococcus. The gene 
encoding DnaE is obtained using polymerase chain reaction amplification of the 
dnaE gene. The gene is subcloned into a standard expression vector either 
30 containing an amino acid tag for ease of purification or not. The enzyme is then 
over-expressed in Escherichia coli and purified using a standard tag system. 

Because DnaE catalyzes the incorporation of single deoxyribonucleotides into 
DNA, the incorporation of radiolabeled deoxyribonucleotides into larger 
deoxyribonucleic acid molecules is monitored to measure activity of the enzyme. A 
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filtration assay has been previously described for Streptococcus pyogenes DnaE that 
uses filterplates containing DE81 filters to capture polymerized DNA. This assay is 
amenable to high-through-put screening format for DnaE. Assays contained 70 ng of 
30-mer primed M13mp1 8 single stranded DNA as a template for replication. The 

5 reaction contained 3.3-300 ng of DnaE in 23.5 pi of replication buffer [20 mM Tris- 
HCL (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 mM DTT, 2 mM ATP, 8 mM MgCI 2 , 40 
ug/ml BSA] and 60 pM of both dGTP and dCTP. NaCI was added to the reaction 
mixture to a final concentration of 40 mM. DNA synthesis was initiated by the 
addition of 1 .5 pi of 1 .5 mM dATP and 0.5 mM [p- 32 P]dTTP. Reactions were 

10 incubated at 37°C for various lengths of time and were quenched by adding an equal 
volume of 1% SDS and 40 mM EDTA. One-half of the terminated reaction was 
applied to DE81 filter paper and washed 3X with wash solution (0.3 M Ammonium 
formate and 0.01 M Sodium pyrophosphate). Filters were then placed in scintillation 
vials and 1 ml scintillation counting liquid was added. Radioactivity was counted 

15 using a scintillation counter. 

EXAMPLE 12 

InPMTIF.CATION OF DNAG. THE GENE FNCODING PRIM A SE IN ALLOIOCOCCUS OTITIDIS 

20 DnaG is an enzyme that catalyzes the DNA template directed polymerization 

of ribonucleotides into ribonucleic acid cfe novo . Ribonucleic acid molecules that are 
synthesized by DnaG primase subsequently serve as primers for synthesis of the 
leading- and lagging-strands during chromosomal replication. Functions for DnaG 
have been defined biochemically, and the crystal structure of the RNA polymerase 

25 domain has been determined in Escherichia coli. Homologue of this gene identified in 
Alloiococcus otitidis as described in Example 5 (Seq. ID No 63). The protein encoded 
by the gene is set forth in Seq. ID No. 64. 

Because DnaG primase plays an essential role in both leading- and lagging- 
strand synthesis during chromosomal replication, and DnaG has homologs in all 

30 prokaryotes but not eukaryotes, it serves as a good target for antibacterial drug 
discovery. A primary assay, which detects DnaG mediated RNA synthesis in vitro, 
can be used to identify inhibitors of the enzyme and is described below. 
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Assay for the activity of DNA polymerase and identification of compounds that 
inhibit DnaG 

The gene encoding DnaG is obtained using polymerase chain reaction 
5 amplification of the dnaG gene. The gene is subcloned into a standard expression 
vector either containing an amino acid tag for ease of purification or not. The 
enzyme is then over-expressed in Escherichia coli and purified using a standard tag 
system. 

Because DnaG catalyzes the incorporation of single ribonucleotides into RNA, 
10 the incorporation of radiolabelled ribonucleotides into larger ribonucleic acid 
molecules is monitored to measure activity of the enzyme. A high-throughput 
scintillation proximity assay (SPA) assay, previously described for E. coli DnaG, is 
used to meadure activity of DnaG activity in a coupled reaction with DnaB helicase. 
The assay, which was shown to work with DnaG alone, is used to screen for 
15 compounds that inhibit DnaG function. Assays are run in 96-weli Packard Optiplate 
plates. First, 1 pi DMSO or test compound was added, followed by 20 pi of DnaG 
(208 nM) and 3.3 nM M13mp1 8 single-stranded DNA. Reactions are initiated by 
adding 10 ul of primase assay buffer [50 mM Tris-HCI (pH 7.5), 4% sucrose, 8 mM 
DTT, 5 mM MgCI 2 , 40 ug/ml BSA, 0.1 pg/ul Rifampicin, 25 U/ml RNA guard, 100 
20 GTP, 100 pM UTP, 3 pM CTP, 1 mM ATP] and 0.4 pCi [ 3 H]CTP. Reactions are 
incubated at 30°C for 30 min. Next, a suspension of 50 pi of 2.5 mg/ml PVT-PEI 
SPA beads (Amersham; prepared in 0.3 M NaCitrate, pH 3.0) were added. Plates 
were read after 1 hr on a Topcount instrument (Packard). 



25 



30 



35 



Example 13 

DmaN. DnaX. HolA, HolB. and PolC. the ge nes encoding the subunits of 

ALLOIOCOCCUS OT777P/SDNA POLYMERASE III HOLOENZYM E: BETA (B>. TAU (T), DELTA 

(A\ DELTA' (A') AND POLC- 

DNA polymerase III holoenzyme is an enzyme complex comprised of multiple 
highly conserved subunits that catalyzes the DNA template directed polymerization of 
deoxyribonucleotides into deoxyribonucleic acid. In gram positive organisms the 
holoenzyme is composed of a polymerase subunit, PolC, and accessory proteins. 
The accessory proteins act in a coordinated manner to clamp the polymerase tightly 
to the DNA template allowing the polymerase to synthesize DNA with high speed and 

-89- 



WO 03/104391 



PCTAJS02/36122 



processivity. Homologue of these genes identified in Alloiococcus otitidis are 
described in Example 5 (Seq. ID Nos. 21, 105, 79, 103, and 105 respectively). The 
protein encoded by the gene is set forth in Seq. ID No. 22, 106, 80, 104 and 106 
respectively). 

5 Functions for the individual subunits have been defined biochemically and 

interactions between them have now been deduced structurally by crystallographic 
analysis of the enzyme from Escherichia coli. Tau interacts directly with both delta 
and delta' to form a clamp loader complex. Upon binding ATP the complex 
undergoes a conformational change altering an interaction between delta and delta', 

10 which allows delta to subsequently interact with the beta-clamp. The beta-clamp is a 
ring-shaped. homomultimer assembly that can be opened by delta and placed onto a 
primed DNA template. ATP hydrolysis results in closing the clamp around DNA and 
dissociation of the clamp-loading complex. PolC then couples with the beta clamp to 
form a highly processive polymerase. 

15 Because DNA polymerase 111 holoenzyme is comprised of multiple subunits, 

the opportunity exists to inhibit its activity at a number of different sites. A primary 
assay, which detects processive DNA synthesis in vitro, can be used to identify 
inhibitors of the enzyme and is described below. Deconvolution of inhibitors, based 
on either activity of physical interaction, follow the primary assay. 

20 

Assay for the activity of DNA polymerase 

Purification of DNA polymerase ill holoenzyme subunits from Alloiococcus. 
Genes encoding the subunits of DNA polymerase is obtained using polymerase 
chain reaction (PCR) amplification of the genomic region encoding them. The genes 

25 are subcloned into a standard expression vector either containing an amino acid tag 
for ease of purification or not. The enzyme is then over-expressed in Escherichia coli 
and purified using a standard tag system. 

Because DNA polymerase III catalyzes the incorporation of single 
deoxyribonucleotides into DNA, the incorporation of radiolabeled deoxynucleotides 

30 into larger deoxyribonucleic acid molecules is monitored to measure activity of the 
enzyme. A filtration assay is previously described for Streptococcus pyogenes DNA 
polymerase III that uses filterplates containing DE81 filters to capture polymerized 
DNA (2). This assay is amenable to high-through-put screening format. Assays 
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contained 70 ng of 30-mer primed M1 3mp1 8 single stranded DNA as a template for 
replication. The reaction contained 43 ng of B and 140 ng of PoIC-taa' complex in 
23.5 Ml of replication buffer (20 mM Tris-HCL (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 
mM DTT, 2 mM ATP, 8 mM MgCl 2 , 40 ug/ml BSA, and 60 uM of both dGTP and 

5 dCTP. DNA synthesis was initiated by the addition of 1 .5 ul of dATP and [ul- 

32 P]dTTP. Reactions were incubated at 37°C for various lengths of time and were 
quenched by adding an equal volume of 1 % SDS and 40 mM EDTA. One-half of the 
terminated reaction was applied to DE81 filter paper and washed 3X with wash 
solution (0.3 M Ammonium formate and 0.01 M Sodium pyrophosphate). Filters were 

10 then placed in scintillation vials and 1 ml scintillation counting liquid was added. 
Radioactivity was counted using a scintillation counter. 

Compounds inhibiting PolC subunit is identified by modifying the above reaction 
to include only the PolC subunit and using 2.5 pg activated calf thymus DNA as a 
substrate, instead of singly-primed M13mp18 DNA, as previously described. 

15 Several techniques are utilized to determine the interaction of inhibitors with 

individual subunits. These have been described in the literature and include the 
following: (1) Nuclear magnetic resonance and capillary electrophoresis. 

20 

Example 14 
Era GTPase in Alloiococcus otitidis 

The era (E. coli Ras) gene was initially identified while sequencing around the 
25 mc gene; era lies downstream of rnc. While a function for era has yet to be 

determined, conditional (temperature sensitive) mutants revealed that the product of 
the era gene, Era, is essential for E. coli viability. A hint as to an in vivo function for 
Era was uncovered when a suppressor of a dnaG (primase) allele was found to map 
in the era coding sequence and a second suppressor, which mapped upstream of the 
30 era open reading frame, affected expression of era. These data suggest that Era 
could play one or more roles in DNA replication, regulation of primase activity or 
otherwise effect ceil cycle progression. More recent data has confirmed that the era 1 
mutant causes a defect in cell growth at the two-cell stage and delays cell division 
Moreover, Britton et al demonstrated that cell division was coupled with the level of 
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Era in the cell: division arrest, through reduction in Era levels, is reversed when Era 
levels return to threshold amount. A current model suggests that Era acts as a 
checkpoint regulator in the bacterial cell cycle. Era is a GTP-binding protein with 
GTPase activity, a threshold level of functional/activated Era may be required to 

5 initiate septation. 

Era is associated with additional cellular functions, specifically translation, as 

Era specifically interacts with the translation machinery. E. coli Era binds both 1 6S 
rRNA and the 30S ribosomal subunit; whereas, the S. pneumoniae 16S rRNA co- 
purifies with Era. A putative RNA binding "KH motif" has been identified in the 
10 carboxyl-terminal domain. The RNA binding activity is critical to Era cellular function 
as mutation of the putative RNA binding region of the S. pneumoniae Era prevents 
complementation of an E. coli era mutant strain. Homologue of this gene identified in 
Alloiococcus otitidis as described in Example 5 (Seq. ID No 65). The protein encoded 
by the gene is set forth in Seq. ID No. 66. 

15 

Nucleotide binding 

Filter-binding assays are utilized to demonstrate nucleotide-binding specific to 
GTP and not UTP, CTP or ATP. Both GTP and GDP (unlabeled) were capable of 
inhibiting a 32 P-GTP binding. The Kd for GTP and GDP binding were reported to be 

20 5.5 and 1 .0 pM, respectively. 

A large number of GTP-binding proteins have been studied and all members 
of the family contain three regions of highly homologous amino acid residues that 
define a GTP-binding pocket. Era contains well-conserved regions defining the so- 
called G1 (G/AXXXXGKT/S: residues 15-22), G3 (DXXG: residues 62-65) and G4 

25 (NKXD: residues 124-128) consensus sequences. The G2 domain (residues 33-38, 
see below), located between G1 and G3, is generally more variable. 

GTPase activity 

Purified Era showed a significant GTPase activity, which is inhibitable by GTP 
30 or GDP but not by UTP, CTP, ATP or ADP. The maximum hydrolysis rate is 

measured at 9.8 mmol GTP hydroiyzed/min/mol Era. The Km was found to be 9 pM. 

It should be noted that Sullivan et al demonstrated, using mant (A/-methyl-3'- 
O-anthraniloyl) labeled GTP and GDP, very rapid exchange kinetics for guanine 



-92- 



WO 03/104391 



PCT/US02/36122 



nucleotide binding. Era exchanges guanine nucleotides 10-fold more rapidly than the 
GTP hydrolysis rate suggesting that guanine nucleotide binding and release should 
be considered as a regulatory point in addition to the more well-studied hydrolysis 
step. 

5 

Autophosphorylation 

When y^P-GTP is used as a substrate for the GTPase activity , Era is 
phosphorylated. The autophosphorylation reaction is specific for GTP, as incubation 
with y^P-ATP did not result in phosphorylation of Era. Moreover, a 32 P-GTP is not a 

10 suitable substrate for detection of Era autophosphorylation. Tryptic digestion and 
HPLC were utilized to resolve the sites(s) of phosphorylation. Using y^P-GTP as a 
substrate the major radioactive peak contained the tryptic peptide, ISITSR, 
corresponding to Era residues 33-38 and containing 3 potential phosphorylation 
sites. Mutagenesis of both Thr-36 and Ser-37 to alanine abolished enzymatic 

15 activity. However, individual alanine substitutions at either site had no effect on Era 
function. The autophosphorylation site is located in the so-called G2 domain of Era. 

Suitability of target for anti-infective development 

Era is an essential protein for bacterial viability. Knock-down mutations as well 
20 as conditional-lethal alleles revealed that Era function is required for cytokinesis. An 
additional phenotype of the Era-depleted strains is an aberrant response to 
temperature induced stress. This target is novel and may well lead to the 
identification of new classes of anti-infectives. The widespread distribution of Era 
homologues in both gram-negative and gram-positive pathogens suggests that 
25 broad-spectrum agents could result from an effort to define Era inhibitory 
compounds. 

Assays for measuring Era function 

30 Nucleotide binding Assays 

Era binding to nucleotide is monitored by a simple filter-binding assay. Era 
(1-5 pg) is incubated with a^P-GTP (0.2 pCi) in a buffer consisting of 100 mM Tris 
(pH 7.5), 10 mM MgCI 2 , 0.2% NP-40, 0.2 mg/ml BSA for 30 minutes at 32°C. A 
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portion of the reaction mix is spotted on nitrocellulose membrane, washed (50 mM 
Tris (pH 7.5), 5 mM MgCI 2 , 1 mM DTT) and dried. The membrane is then exposed to 
X-ray film. Alternatively, the spots are excised and counted. This assay is directly 
amenable to HTS using filter plates. 

5 

GTPase activity Assay 

The GTP hydrolytic activity of Era is monitored using thin-layer 
chromatography. Era and a^P-GTP is incubated in 50 mM Tris (pH 7.5), 5 mM 
MgCI2, 0.1 % NP-40, 0.2 mg/ml BSA for 30 minutes at 37°C. An aliquot of the 

10 reaction is placed on PEI cellulose and the strip developed with 0.5 M KH 2 P0 4 , 1 .0 M 
NaCI (pH 3.7). The spots conforming to GDP and GTP are identified by UV 
shadowing, excised and counted. This assay represents an acceptable 
secondary/confirmatory assay. 

Alternatively, the hydrolysis of y^P-GTP is monitored by assaying for 

15 liberated Pi. Obg and a 32 P-GTP is incubated in 50 mM Tris (pH 8.5), 1 .5 mM MgCI 2 , 
0.1 mM EDTA, 100 mM KCI, 10% glycerol for 30 minutes to 3 hours at 37°C. The 
reaction will be stopped by the addition of a slurry of charcoal in 1 mM Kpi (pH 7.5), 
which selectively binds the GTP and GDP. The liberated P-, in the supernatant is 
monitored by Cerenkov counting. Free Pi can also be monitored with the Malachite 

20 Green reagent. 

Auto phosphorylation Assay 

Era autophosphorylation is monitored by incubating Era with Y 32 P-GTP in 50 mM 
morpholinopropane sulphate (pH 6.8), 5 mM MgCI2, 1 mM DTT at 37°C (14).' 
25 Samples are analyzed following separation on SDS polyacrylamide gels, drying the 
gel and exposure to film. This assay represents an acceptable 
secondary/confirmatory assay for Era activity. 
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EXAMPLE 15 

FmhB(FemX) Genes in Alloiococcus Otitidis 

5 The femA, femB, and fmhB(femX) genes have been shown to be essential for 

incorporation of glycine into the side chain of peptidoglycan precursors in 
Staphylococcus aureus,. The femAB locus was initially identified as a factor essential 
for methicillin resistance {fern) based on random insertional inactivation of 
chromosomal genes and a screen for reduced expression of resistance mediated by 
10 the penicillin binding protein 2A (PBP2A). Inactivation of femA or femB was 

subsequently reported to prevent incorporation of glycine residues at positions 2 to 5 
or positions 4 to 5 of the penta-glycine cross bridge since neuropeptides cross-linked 
by one or three glycine residues were detected in the corresponding mutants. 
Inactivation of fmhB, formerly femX, is lethal, but the construction of a mutant 
15 conditionally expressing fmhB under the control of a xylose-inducible promoter 
showed that the gene was essential for synthesis of branched peptidoglycan 
precursors . These studies show that the fern gene products were required for 
incorporation of glycine at positions 1 (FmhB), 2 and 3 (FemA), and 4 and 5 (FemB) 
of the cross bridge, although the catalytic activity of the proteins has not been directly 
20 assessed. Similarly, inactivation of two fmhB homoiogues in Streptococcus 

pneumoniae, designated murM(fibA) and murN{fibB), reduced addition of L-Ala or L- 
Ser to the -amino group of L-Lys and subsequent addition of a second L-Ala residue, 
respectively. Overall, disruption of the murMN operon reduced the proportion of 
branched peptide stems in the peptidoglycan from 89 to 33% . In contrast to what 
25 occurs in S. aureus, direct cross-linking of L-Lys to D-Ala occurs in S. pneumoniae, 
and the murMN operon was accordingly reported to be unessential. 

BLAST analysis of Alloiococcus otitis genome revealed an ORF similar to 
femXoi Weissella viridescent , and fmhB of S. aureus. It suggests that in 
Alloiococcus otitis there is an enzyme with similar to FhmB function. Homologue of 
30 this gene identified in Alloiococcus otitidis is described in Example 5 /Table 4 (Seq. 
ID No 97). The protein encoded by the gene is set forth in Seq. ID No. 98. 
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Assays for measuring FmhB function 

There are no in vitro biochemical assays to test enzymatic activity of S. 
aureus FmhB because the reaction occurs at the membrane-bound lipid II precursor 
GlcNAc-(|}-1 ,4)- A/- acetylmuramic acid(-L-Ala-D-iGln-L-Lys-D-Ala-D-Ala)- 
5 pyrophosphoryl-undecaprenol. 

Lipid II is a minor component of bacterial cell membrane which is detected by 
thin-layer chromatography separation of presolubilized membranes supplied with the 
cytoplasmic precursors, UDP-/V-acetylmuramyl-pentapeptide (UDP-MurNAc- 
pentapeptide) and [ 14 C]UDP-N-acetylglucosamine ([ 14 C]UDP-GlcNAc). 
10 The in vitro biosynthesis of branched lipid II of S. aureus requires whole-cell 

membranes, cytoplasmic PG precursors, glycine ( 14 C labeled for detection of reaction 
products), purified tRNA, and an intracellular fraction that contains tRNA-activating 
enzymes. Therefore, the in vitro assay of S. aureus FmhB is a tedious procedure. 
One way to facilitate this procedure is to use Weisselia viridescensFemX or 
15 E. faecalis UDP-MurNac-pentapetide:L-alanine ligase. Recombinant Weisseiia 
viridescensFemX and E. faecalis UDP-MurNac-pentapetide:L-alanine ligase were 
purified, and their in vitro activity was demonstrated. The distinctive feature of these 
enzymes is that they catalyze the addition of a branching amino acid (Ala) to the 
cytoplasmic cell wall precursor UDP-MurNac-pentapetide. 
20 Other bacteria for which the biosynthesis of Gly-containing branched UDP- 

MurNac-hexapeptide in cytoplasm was shown are Streptomyces lividans and 
Streptomyces hydroscopicus , although the enzymes were not isolated and their 
ligase activity remain to be demonstrated. 

These new data open an opportunity to develop an assay to detect the 
25 activity of FmhB(FemX) by using cytoplasmic UDP-MurNac-pentapetide. 

Products of the reaction are detected by HPLC. HPLC separation of precursors are 
performed by the method of Flouret et al. The precursors are separated by reverse- 
phase HPLC on a //Bondapak C 18 column (3.9 by 300 mm; Waters) in 50 mM 
ammonium formate (pH 3.9) at a flow rate of 0.5 ml/min. The elution of precursors is 
30 monitored at a wavelength of 254 nm. 
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Example 16 
FolA- Dihydrofolate reductase (DHFR) 

5 The Alloiococcus ORF-1 863 encodes a homolog of S. aureus dihydrofolate 

reductase that catalyzes the NADPH-dependent conversion of dihydrofolate to 
tetrahydrofolate, one of the steps in bacterial folate biosynthesis. Homologue of this 
gene identified in Alloiococcus otitidis is described in Example 5>Table 4 (Seq. ID No 
55). The protein encoded by the gene is set forth in Seq. ID No. 56. 

10 

FolA as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in bacteria, 
such as purine, pyrimidine, amino acid and pantothenate biosynthesis. Unlike 
mammalian cells, bacteria are unable to utilize exogenous folate derivatives, and 

15 therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via two 
converging pathways, the non-essential para-amino-benzoate (PABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Bacterial DHFRs are essential for viability and 
well conserved across all bacterial species. Although bacterial DHFR shares 

20 similarity with human DHFR, selective inhibitors against bacterial DHFR have been 
identified in the past such as trimethoprim which specifically blocks the bacterial 
DHFR step. Thus DHFR still remains an attractive target for development of broad- 
spectrum antibacterial agents. 

25 Assays for measuring DHFR activity 

DHFR activity is monitored spectrophotometrically, recording the change of 
absorbance at 340 nm due to the equimolar consumption of NADPH in the course of 
dihydrofolate substrate reduction. DHFR (10 ng) is preincubated in reaction buffer 
containing 50 mM 2-(N-morpholino)ethanesulfonic acid, 25 mM Tris-HCI, 25 mM 

30 ethanolamine, and 1 00 mM NaCI at pH 7.5 for 3 minutes. The reaction is started by 
addition of 0.5-10 jxM 7,8-dihydrofolate. The amount of processed substrate is 
calculated from the decrease of absorbance at 340 nm due to oxidation of NADPH 
(□=1 1 800 M" 1 cnrf 1 ) to NADP+. 
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Example 17 

FolB- Pihyproneopterin aldolase (DHNA) 

5 The Alloiococcus otitidis ORF-959 encodes a homolog of S. aureus 

dihydroneopterin aldolase that catalyzes the conversion of 7,8-dihydroneopterin to 6- 
hydroxymethyl-7,8-dihydropterin f one of the early steps in bacterial folate 
biosynthesis. Homologue of this gene identified in Alloiococcus otitidis is described in 
Example 5/Table 4 (Seq. ID No 31): The protein encoded by the gene is set forth in 

10 Seq. ID No. 32. 

FolB as a target for anti-Infective development 

Folate is an essential cofactor in many important metabolic processes in 
bacteria, such as purine, pyrimidine, amino acid and pantothenate biosynthesis. 

15 Unlike mammalian cells, bacteria are unable to utilize exogenous folate derivatives, 
and therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via 
two converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Enzymes that catalyze steps in the folate 

20 biosynthesis pathway are essential and well conserved across all bacterial species, 
and those that act in early steps such as FolB have no direct homologs in mammals. 
Thus FolB becomes an attractive target for development of broad-spectrum 
antibacterial agents. 

25 Assays for measuring FolB activity 

FolB (DHNA) 7,8-dihydroneopterin aldolase activity is monitored individually 
or in conjunction with downstream enzymes in folic acid biosynthesis pathway (FolK 
and Sul). 

FolB activity is monitored directly by HPLC assay. FolB substrate (7,8- 
30 dihydro-D-neopterin) is commercially available from Schircks Laboratories 

(Swizerland). FolB (0.5 \iq) is preincubated in reaction buffer containing 50 mM Tris- 
HCI (pH 8.0), 50 mM KCI, 0.1 mg/ml BSA, 2.5 mM dithiothrietol for 5 min. Reaction 
is started by addition of stock solution of 7,8-dihydro-D-neopterin in DMSO (100 fxM 
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final concentration). Reaction is terminated by addition of 1/3 of reaction volume of 
1% l 2 , 2% Kl in 1M HCI with subsequent incubation at room temperature for 5 
minutes. Quenched reaction will be applied directly to HPLC. Oxidized starting 
material and reaction products are efficiently separated on ODS (C18) column. 

5 Reaction components are detected and quantified by analysis of UV absorbance at 
254 nm, or fluorescence (excitation at 365 nm; emission at 446 nm). 

FolB activity are also monitored in the coupled assay with FolK (HPPK) and Sul 
(DHPS) enzymes. FolB activity is measured by detection of radioactive 
dihydropteroate formation as described in FolK and Sul assays, under conditions of 

10 excess of the later enzymes. FolB enzyme and substrate 7,8-dihydro-D-neopterin 
are added to the described assay to replace the 6-hydroxymethyl-7,8<lihydropterin 
(FolK substrate). 

Example 18 

15 FolC- Dihydrofolate synthase (PHFS) 



The Alloiococcus otitidis ORF-956 and ORF-528 both encode a homolog of B. 
subtilis dihydrofolate synthase that catalyzes the conversion of 7,8-dihydropteroate 
and glutamate to dihydrofolate, one of the steps in bacterial folate biosynthesis [. 
20 Homologue of this gene identified in Alloiococcus otitidis as described in Example 5 
(Seq. ID Nos. 29 and 23). The protein encoded by the gene is set forth in Seq. ID 
Nos. 30 and 24. 



Use of FolC as a target for anti-infective development 

25 Folate is an essential cofactor in many important metabolic processes in bacteria, 

such as purine, pyrimidine, amino acid and pantothenate biosynthesis. Unlike 
mammalian cells, bacteria are unable to utilize exogenous folate derivatives, and 
therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via two 
converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 

30 pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Enzymes that catalyze steps in the folate 
biosynthesis pathway are essential, and are well conserved across all bacterial 
species. Bacterial FolC appears to be a bifunctional enzyme possessing both 
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dihydrofolate synthase (DHFS) activity and folyl-poiyglutamate synthetase (FPGS) 
activity, which are probably mediated through different sites of the protein. The 
bacterial DHFS activity but not the FPGS activity is essential for viability. Although 
bacterial FolC shares similarity with human FPGS, the human enzymes apparently 
5 lack DHFS activity and display a folate substrate specificity quite distinct from that of 
bacterial enzymes. Thus targeting bacterial FolC/DHFS activity selectively might lead 
to identification of broad-spectrum antibacterial agents. 

Assays for measuring FolC activity 
10 FolC (DHFS) 7,8-dihydrofolate synthase activity in the presence or absence 

of antimicrobial compounds or putative inhibitory compounds are monitored by 
several methods. 

In one method, FolC activity is monitored directly by simple HPLC assay. 
FolC substrate (7,8-dihydropteroic acid) is commercially available form Schircks 

15 Laboratories (Switzerland). FolC (15 ng) is added to reaction mix, containing 10 mM 
glutamate, 5 mM ATP, 50 mM Tris-HCI (pH 8.0), 20 mM Mg 2 CI, 50 mM KCl, 0.1 
mg/ml BSA, 5 mM dithiothreitol. Reaction is started by addition of stock solution of 
7,8-dihydropteroic acid in DMSO (10 [iM final concentration). Reaction is terminated 
by addition of equal volume of 8M Guanidinium hydrochloride. Stopped reaction is 

20 applied directly to HPLC. Starting material and reaction products are efficiently 

separated on ODS (C1 8) column. Reaction components are detected and quantified 
by analysis of UV absorbance at 254 nm, or fluorescence (excitation at 280 nm; 

emission at 420 nm). 

In another method, the FolC activity monitoring is by detection of ADP 

25 accumulation. ADP is released in the amount equimolar to the amount of the product 
formed. ADP detection is performed by coupling its conversion to ATP by pyruvate 
kinase in the presence of phospho(enol)pyruvate producing pyruvate. Lactate 
dehydrogenase reduces pyruvate to S-lactate in the presence of NADH. Course of 
reaction is monitored by decrease in absorbance at 340 nm due to oxidation of 

30 NADH (£=6220 cm' 1 M" 1 ) to NAD + . Reaction conditions are as following: 5 mM 
dithiothreitol, 5 mM ATP, 380 jxM NADH, 10 mM glutamate, 2 mM 
phospho(enol)pyruvate, 50 mM KCl, 20 mM Mg 2 CI, 50 mM Tris-HCI, 50 jig of 
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pyruvate kinase, 50 \ig of S-lactate dehydrogenase. Reaction is started by addition 
7,8-dihydropteroic acid in DMSO (10 m-M final concentration). 

In yet another method, FolC activity is monitored through detection of 
inorganic phospate release. Amount of inorganic phosphate in solution is quantified 
5 by: 

(i) its conversion by purinenucleoside phosphorylase leading to 
phosphorylation of MESG. Later assay kit is available from Molecular Probes 
as EnzCheck™ Phosphate Assay Kit; 

(ii) its reaction with Malachite Green reagent; and 

10 (iii) detecting the release of radioactive inorganic phosphate in reaction with -y- 

33 P-labeled ATP following the absorption of unprocessed ATP by charcoal. 

First method is applied in rate-based assay format; the later two in 
end-point format Reaction conditions are similar to the ones described in 
HPLC-based assay. 

15 

Example 19 

FOLK- 6-HYDROXYM ETHYL-7. 8-D1HYDBOPTER1N PYROPHOSPHOK INASE f HPPK) 

The Alloiococcus otitidis OFR-961 (Seq. ID No. 33) encodes a homolog of S. 
20 aureus 6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase that catalyzes 

pyrophosphoryl transfer from ATP to 6-hydroxymethyl-7,8-dihydropterin, one of the 
early steps in bacterial folate biosynthesis. The protein encoded by this ORF is set 
forth in Seq. ID No. 34. (see Example 5/Table 4). 

25 Use of FolK as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in 
bacteria, such as purine, pyrimidine, amino acid and pantothenate biosynthesis. 
Unlike mammalian cells, bacteria are unable to utilize exogenous folate derivatives, 
and therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via 
30 two converging pathways, the non-essential para-ami no-benzoate (pABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 
attached to form the folate precursor. Enzymes that catalyze steps in the folate 
biosynthesis pathway are essential and well conserved across all bacterial species, 
and those that act in early steps such as FolK have no direct homologs in mammals. 
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Thus FolK is an attractive target for the development of broad-spectrum antibacterial 
agents. 

Assays for measuring FolK activity 

5 

FolK (HPPK) 7,8-dihydroxymethylpterin-pyrophosphokinase activity is 
monitored individually or in conjunction with downstream enzyme in folic acid 
biosynthesis pathway. 

FolK activity is monitored directly by HPLC assay. FolK substrate (7,8- 

10 dihydro-6-hydroxymethylpterin) is commercially available from Schircks Laboratories 
(Swizeriand). FolK is preincubated in reaction buffer containing 50 mM Tris-HCI (pH 
8.0), 50 mM KCI, 20 mM MgCI 2 , 5 mM ATP, 0.1 mg/ml BSA, 2.5 mM dithiothrietol. 
Reaction is started by addition of stock solution of 7,8-dihydro-6-hydroxymethylpterin 
in DMSO (100 jiM final concentration). Reaction is terminated by addition of equal 

15 volume of 8M Guanidinium hydrochloride and applied directly on HPLC. Starting 
material and reaction products are efficiently separated on ODS (C1 8) column. 
Reaction components are detected and quantified by analysis of UV absorbance at 
254 nm. 

FolK activity is monitored by end-point assay coupled with excess of Sul enzyme. 
20 Activity is calculated from quantification of the radioactivity incorporated in final 
product (7,8-dihydropteroate). 

Example 20 

ALLOIOCOCCUS OTITIDIS ENCODED FOLP (SUL>- DlHYDROPTEROATE S YNTHASE (DHPS) 

25 

The Alloiococcus otitidis ORF-181 1 (Seq. ID No. 53) encodes a homolog of B. 
subtilis dihydropteroate synthase that catalyzes the condensation of pABA (para- 
aminobenzoic acid) with 6-hydroxymethyl-7,8-dihydropterin pyrophosphate, one of 
the early steps in bacterial folate biosynthesis. The polypeptide encoded by this ORF 
30 is set forth in Seq. ID No. 54. (see Example 5/Table 4) 

FolP as a target for anti-infective development 

Folate is an essential cofactor in many important metabolic processes in bacteria, 
such as purine, pyrimidine, amino acid and pantothenate biosynthesis. Unlike 
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mammalian cells, bacteria are unable to utilize exogenous folate derivatives, and 
therefore must synthesize folate de novo. Bacterial folate biosynthesis occurs via two 
converging pathways, the non-essential para-amino-benzoate (pABA) synthesis 
pathway, and synthesis of the pterin precursor, to which pABA is subsequently 

5 attached to form the folate precursor. Enzymes that catalyze steps in the folate 

biosynthesis pathway are essential and well conserved across all bacterial species, 
and those that act in early steps such as FolP (Sul) have no direct homologs in 
mammals. In fact, dihydropteroate synthase (FolP or Sul) is the target for known 
antibiotics sulfonamides which are competitive inhibitors of FolP/Sul as pABA 

10 analogues. Thus FolP (Sul) still remains an attractive target for development of 
broad-spectrum antibacterial agents. 

Suitable assays for measuring FolP/Sul activity 

Sul (DHPS) 6-hydroxymethy-7,8-dihydroneopteroate synthase activity is 

15 monitored individually or in conjunction with upstream enzymes in folic acid 
biosynthesis pathway (FolB and/or FolK). 

DHPS activity is monitored directly by counting the amount of radioactivity 
incorporated in 6-hydroxymethy-7,8-dihydroneopteroate when using radioactively 
labeled p-aminobenzoic acid (pABA). Final product is separated from unreacted 

20 pABA by thinlayer chromatography, paper chromatography or on HPLC equipped 
with radioactivity detector. DHPS substrate (6-hydroxymethyl-7,8-dihydropterin 
pyrophosphate) is not commercially available, but is quantitatively synthesized in one 
step from its oxidized precursor available from Schircks Laboratories (Swizerland). 
DHPS (20 ng) is added in reaction buffer containing 50 mM Tris-HCl, pH 8.0, 20 mM 

25 MgCI 2 , 0.1 mg/ml BSA, 5 mM dithiothreitol and 0.5-10 jiM PABA. Reaction is 

started by addition of stock solution of substrate (6-hydroxymethyl-7, 8-dihydropterin 
pyrophosphate, 0.05-1 \iM final concentration). Reaction is terminated by 
acidification of reaction volume with addition of equal volume of citrate/phosphate or 
ammonium acetate/acetate buffer, pH 4 containing excess of unlabelled pABA. 

30 Quenched reaction is separated by chromatography and the amount of formed 
product calculated. 

DHPS activity is determined in coupled assay with excess of FolB and FolK 
enzymes. The advantage of coupled assay is that it makes it possible to use 
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commercially available FolB (7,8-dihydro-D-neopterin), or FolK (6-hydroxymethyl-7,8- 
dihydropterin) substrates, thus forming DHPS substrate in situ. 

Example 21 

ALLOIOCOCCUS OT1TIDIS ENCODED FILAM ENT ATIQN TEMPERA TURE SENSITIVE GENE A 

" (FtsA) 



The Alloiococcus otitidis ORF-2489 (Seq. ID No. 85) encodes a homolog of E 
10 faecalis FtsA, one of the essential components of bacterial cell division. The "ftsT 
stands for fomentation temperature sensitive and has been assigned to most 
bacterial cell division genes due to the fact that these genes were generally 
discovered by the isolation of conditional mutants that form filaments at 
nonpermissive temperature . The ftsA allele was first isolated and identified in E. coli 
15 by Ricard and Hirota in 1973, and mapped along with ftsZ in 1980.The protein 
encoded by this ORF is set forth in Seq. ID No. 86. (see Example 5/Table 4) 

Bacterial cell division requires formation of a septum at mid-cell that begins 
with the polymerization of FtsZ into a ring structure at the nascent division site. FtsZ, 
another key component of bacterial septation is the first known protein to localize to 
20 the division site. In E. coli, shortly after the formation of the FtsZ ring, FtsA and ZipA 
(another key division component present only in gram-negative bacteria) [7] are 
independently recruited to the septal ring, most likely through their direct interaction 
with FtsZ. Subsequent assembly of other division components at the septum requires 
FtsA as well as FtsZ. 

25 

FtsA as a target for anti-infective development 

Like FtsZ, FtsA homologs are present and highly conserved in almost all 
eubacteria. FtsA is essential for cell division and its deletion leads to impaired ceil 
division and sporulation defect. In addition, E. coli cells have to maintain critical ratio 
30 of FtsA to FtsZ in order for proper cell division to occur. FtsA belongs to the 

actin/DnaK/sugar kinase family of proteins. In B. subtilis, FtsA acting as a dimer riot 
only binds ATP but also hydrolyzes ATP. As briefly stated above, in vivo and in vitro 
evidence have demonstrated that FtsA and FtsZ from various bacterial species 
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directly interact. Taken all together, targeting at FtsA especially at its interaction with 
FtsZ might lead to identification of broad-spectrum antibacterial agents. 

Assays for measuring FtsA activity 

5 

ATPase activity of FtsA is assayed by following the formation of ^Pi from [y-^P]- 
ATP. The reaction mixture containing 50 mM Tris-HCl (pH7.2), 50 mM potassium 
acetate, 1 mM DTT f 10 mM MgCI 2 and different concentrations of [y- 32 P]-ATP is 
incubated for 5 minutes at 37°C. The reaction is started by addition of 50 nM purified 
10 FtsA of Alloiococcus. The reaction is stopped with 1 .5% ammonium molybdate in 
0.5N sulfuric acid, and the radioactive Pi extracted into isoamyl alcohol and counted. 

Interaction between FtsA and FtsZ is detected quantitatively using yeast two- 
hybrid system as described. Briefly, Alloiococcus ftsZ is cloned into yeast two-hybrid 
bait vector pLexA (Clontech) to generate a LexA-FtsZ fusion with DNA-binding 
15 property. Alloiococcus ftsA is cloned into the target vector pB42AD (Clontech) to 
fuse FtsA to the activating domain. Both plasmids are then transformed into a 
Saccharomycyces cerevisiae strain containing a lacZ reporter under the control of 
multiple LexA operators. p-Galactosidase activity is determined to quantify relative 
strength of FtsA-FtsZ interaction. 



20 



EXAMPLE 21 

ALLOIOCOCCUS OTITIDtS ENCODED F1 LAMENTATION TEMPERAT URE SENSITIVE GENE Z 

(FTSZ> 

25 FtsZ is an essential protein that forms a cytokinetic ring (Z-ring) that drives 

cell division in bacteria. FtsZ has been identified in most prokaryotic species with the 
exception of Chlamidia, a Ureaplasma species and Crenarchaea. FtsZ and Z-ring 
formation are most extensively studied in E. coll FtsZ is an abundant cytoplasmic 
protein which is present at ~ 10 4 copies per cell, and is the first protein to be localized 

30 to the division site. Z-ring is required throughout septation and directs the ingrowth of 
septum in part by recruiting other cell division protein to the division site. Another 
function is suggested by FtsZ homology to eukaryotic tubulins. Like tubulin, FtsZ is a 
GTPase and undergoes GTP/GDP-dependent polymerization. Recent studies 
showed that Z-ring is a very dynamic structure suggesting that GTP-dependent 
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assembly/disassembly of Z-ring might provide constriction force to power cell 
division. Homologue of this gene identified in AHoiococcus otitidis is described in 
Example 5/Table 4 (Seq. ID No 83). The protein encoded by the gene is set forth in 
Seq. ID No. 84. 

5 

GTPase activity 

FtsZ is a GTPase that contains the tubulin-signature nucleotide-binding motif 
GGGTGS/TG. Like in □□□-tubulin dimer, the active site for GTP-hydrolysis appears 
to be shared between two subunits where the GTP-binding pocket is provided by one 
10 subunit while the GTPase-activating T7 loop comes from the other subunit This view 
is supported by genetic analysis as various mutations that inhibit FtsZ GTPase 
activity map in the T7-loop region and a conserved Asp-residue in T7-loop is found to 
be involved in the coordination of the cation involved in GTP hydrolysis. FtsZ 
GTPase activity is Mg 2+ -dependent and is stimulated by KCI. 

15 

Polymerization 

In vivo, about 75% of FtsZ is present as multimers. In vitro, FtsZ forms a 
variety of structures at various conditions. FtsZ assembles into thin protofilaments 
with GTP and formation of FtsZ polymers is coupled to GTP hydrolysis: when GTP 
20 runs out, polymers disassemble. Protofilaments assemble into sheets and bundles in 
the presence of multimolar amounts of either Mg 2+ or Ca 2+ or by addition of DEAE- 
dextran. In addition, ZipA protein induces bundling of FtsZ polymers. With GDP, FtsZ 
assembles into curved filaments and minirings. 

25 Interactions with other proteins 

In E. coli, at least nine different proteins are localized to the division septum and 
are required for cell division to proceed. Among them two proteins, ZipA and FtsA, 
are shown to interact directly with FtsZ. Both of these proteins localize to the division 
site independently from each other, but require FtsZ for localization. ZipA is an 
30 integral membrane protein which is thought to mediate invagination of cell membrane 
by linking the membrane to constricting Z-ring. Interaction between ZipA and FtsZ is 
confined to C-terminal portion of ZipA (residues 185-328) and conserved 17-amino 
acid region on C-terminus of FtsZ. FtsA is an actin-like membrane-associated protein 
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which possesses ATPase activity and might provide energy required for Z-ring 
dynamics. Interaction between FtsZ and FtsA is not studied in great detail, it is shown 
that C-terminus of FtsZ is required. The remaining division proteins require both ZipA 
and FtsA for their localization to Z-ring. 

5 

FtsZ as a target for anti-infective development 

FtsZ is an essential protein for cell division/bacterial viability. Knock-out ftsZ 
mutants fail to divide and, as a result, filament and die. The target is widely 

10 conserved throughout bacterial kingdom implying that FtsZ-specific inhibitor would 
have a broad-spectrum antibacterial activity. The potential drawbacks of the target 
might include the presence and the essential role of a homolog (tubulin) in 
eukaryotes and an intrinsic difficulty in inhibiting protein-protein interactions by small 
molecules. Although this target is being studied extensively, no FtsZ-specific 

15 compounds are reported up to date. 

Assays for measuring FtsZ function 

Polymerization of FtsZ is measured by light scattering assay as described 
previously. FtsZ (12.5 pM) is incubated in 200 pi of polymerization buffer (50 mM 

20 MES/NaOH, pH 6.5, 50 mM KCI, 5 mM MgCI 2 , 10 mM CaCI 2 ) in a fluorescence 

cuvette with a 1 cm path length. The sample is maintained at 30°C, polymerization is 
induced by addition of 20-500 pM GTP. Light scattering is measured at 90°, both 
excitation and emission wavelengths are set to 350 nm, slit width is 2 nm. 
Alternatively, the amount of polymerized FtsZ is analyzed by sedimentation and 

25 subsequent quantification of precipitated FtsZ by SDS-PAGE, Coomassie staining 
and densitometric scanning. In addition, polymers are observed by electron 
microscopy. This assay represents either primary or secondary/confirmatory assay. 

GTP binding of FtsZ is monitored by the covalent cross-linking of [y- 32 P]GTP 
(3000 Ci/mmol) to FtsZ in a previously described competition assay. FtsZ (3 pg) is 

30 incubated in 20 pi of 50 mM MES/NaOH, pH 6.5, 100 mM KCI, 4 mM MgCI 2 , 1 mM 
EDTA, 0.1 mM EGTA and 0.5 mM DTT. Various amounts of non-labeled competing 
nucleotide (GTP or GTP analogs) and 0.1 mM [y-^PJGTP are added, samples are 
incubated at 0°C for 1 5 min, then UV cross-linked for 5 min and analyzed by SDS- 
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PAGE on 12% gel, autoradiography and densitometric scanning. This assay 
represents a secondary/confirmatory assay. 

The GTP hydrolytic activity of FtsZ is monitored by thin-layer chromatography 
(TLC) as described previously. Briefly, the reaction mixture consists of 5 mM of [y- 

5 ^PJGTP (40 mCi/mmol), 1 5 mM magnesium acetate and 0.25-2 mg/ml of FtsZ in 
reaction buffer (40 mM Tris-acetate, pH7, 200 mM potassium acetate, 2 mM EDTA, 1 
mM DTT and 0.5% Triton X-100), aliquots are separated by TLC and amount of GTP 
converted to GDP is determined by spot-densitometry. Alternatively, GTPase activity 
is measured either by quantitation of the non-radioactive inorganic phosphate with 

10 the malachite green-molybdate reagent as described previously or by quantitation by 
scintillation counting of radioactive inorganic phosphate released after hydrolysis of 
[y-^PJGTP (26). This assay represents either primary or secondary/confirmatory 
assay. 

Among interactions of FtsZ with various cell division proteins, interaction 
15 between FtsZ and ZipA is characterized the best. ZipA -induced bundling of FtsZ is 
measured by the light scattering assay that is described above, both proteins are 
used at ;>5 uM. 



Example 22 

20 ALLOIOCOCCUS OTITIP1S ENCODED GYRA/GYRB ( DNA GVRASE. TOPOISOMERASE IP 

AND GRLA/GRLB (TOPOISOMERASE IV) 

DNA topoisomerases: topoisomerases modulate the topological state of DNA 
in cells. This involves binding to DNA, introducing single or double stranded breaks 

25 in the DNA, passing DNA molecules through the break and rejoining the break. This 
controls the levels of positive and negative supercoiling of DNA and functions in 
catenation/decatenation. Controlling the topological state of DNA is critical to the 
fundamental processes of transcription, recombination, replication and partitioning of 
the chromosome. There are two main categories of topoisomerases, type I and type 

30 II. Type I topoisomerases introduce single stranded breaks in DNA whereas type II 
enzymes introduce double stranded breaks. GyrA/GyrB (gyrase) and GrIA/GrIB 
(topoisomerase IV) are both type II enzymes that are essential for cell viability. 

DNA gyrase (GyrA/GyrB) is a type II topoisomerase that functions to control 
the degree of supercoiling in double stranded DNA. It is essential for viability and 
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plays central roles in replication, repair, recombination and transcription of DNA. 
Gyrases have the ability to introduce double stranded breaks in DNA molecules while 
remaining bound to the DNA through phosphotyrosine bonds, pass uncut DNA 
through the break and then rejoin the breaks, with repeated cycles being driven by 
5 the hydrolysis of ATP. Gyrase has the unique ability to introduce negative supercoils 
in closed circular DNA and also functions to catenate/decatenate DNA duplexes. 
The generation of negative supercoiling is important for initial stages in replication. 
DNA gyrase from Escherichia coli has been studied in detail. It is a complex of two 
subunits of GyrA (encoded by gyrA) and two subunits of GyrB (encoded by gyrB) (ie. 
10 A 2 B 2 complex). The subunits are organized in discreet domains. An N-terminal 
domain of GyrB harbors ATPase activity while the C-terminal domain is thought to 
interact with the GyrA subunit, and is involved in DNA binding. The N-terminal 
domain of GyrA is apparently involved in DNA strand breakage-ligation reactions 
while the C-terminal segment is involved in DNA binding. Crystal structures of the 
15 DNA strand breakage/reunion domain of E. coli GyrA, and the N-terminal ATPase 
domain of E. coli GyrB have been determined. DNA gyrase has also been purified 
and characterized from gram positive organisms such as S. aureus. Comparison of 
DNA gyrases from several bacteria reveal a high degree of conservation of important 
domains. 

20 Topoisomerase IV (GrIA/GrIB) is a type I! topoisomerase but unlike gyrase it 

does not possess negative supercoiling activity. Its primary role in replication 
appears to be in the decatenation of multiply linked daughter chromosomes, 
important for terminal stages of the replication process. Topoisomerase IV has been 
purified and characterized from gram negatives eg. E. coli, (where the GrIA/GrIB 

25 subunit homologs are designated ParC and ParE), and gram positives eg S. aureus. 
Homologs of thse gene identified in Alloiococcus otitidis is described in Example 
5/Table 4 (Seq. ID Nos 1 7 and 19). The proteins encoded by the genes are set forth 
in Seq. ID Nos. 1 8 and 20. 
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GyrA/GyrB (Gyrase) and GrlA/GrlB (topoisomerase IV) as targets for anti- 
infective development: 

Alloiococcus otitidis is an infectious organism associated with disease, and 

5 consequently, novel antimicrobials to combat these infections are desirable. DNA 
gyrase and Topoisomerase IV is essential for bacterial viability and is a well- 
established and validated antibacterial target. 

Purification of DNA gyrase and topoisomerase IV from Alloiococcus 

Genes encoding the GyrA/GyrB and GrlA/GrlB subunits or their functional 
domains are obtained using polymerase chain reaction amplification of the genomic 
region encoding them. The genes are then subcloned into standard expression 
vectors, with or without affinity tags. The enzyme is then overexpressed in 
15 Escherichia coli and purified using a standard tag system or conventional 
chromatography. 

Measurement of gyrase and topoisomerase IV by kinetoplast DNA 

decatenation assay: , , .... . 

20 Type II topoisomerases introduce double stranded breaks in DNA and 

mediate catenation/decatenation of DNA. Topoisomerase IV activity is readily 

determined with decatenation assays using as substrate kinetoplast DNA (KDNA) 

from Crithidia fasciculata. The DNA isolated in this procedure is a highly networked 

series of catenated double stranded minicircles and is easily be pelleted by 

25 centrrfugation. The activity of topoisomerase II enzymes results in the release of 

decatenated DNA minicircles from the networked KDNA. These have a high mobility 
in agarose gels and migrate into the gel ahead of the networked material, which has 
very low mobility, allowing for determination of decatenation activity using ethidium 
bromide stained agarose gel electrophoresis. 

30 Alternatively, using radiolabeled KDNA, the level of decatenation activity is 

measured by counting radioactivity remaining in reaction supernatants following 
centrifugation to pellet the networked material. Typical conditions used for assaying 
decatenation activity of S. aureus and E. coli topoisomerase IV activity are as follows: 
C. fasciculata KDNA (0.9 mg/ml) is incubated in 40 pi of reaction buffer (50 mM Tris- 

35 HCi, pH 7.7, 5 mM MgCI 2 , 5 mM DTT, 50 pg/ml bovine serum albumin, 1 .5 mM ATP 
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and 350 mM potassium glutamate) with appropriate amounts of the GrI subunits, for 
1 hour at 37° C. if non radiolabeled KDNA is used, these reactions can be stopped 
and analyzed by agarose gel electrophoresis, or for radioassays, the reaction is 
stopped by gentle mixing with 10 pi of stop solution (50 % glycerol, 50 mM EDTA (pH 

5 8.0), 2.5 % SDS and 0.1 % bromphenyl blue) and centrifuged at 1 5 000 x g for 5 min 
at 20° C. Decatenation activity is determined by counting radioactivity in 25 p! of the 
supernatant in a scintillation counter. Alternatively, a modified assay employing flow 
injection fluorometry of 4\ 6-diaminidino-2-phenylindole (DAPI) treated supernatants 
has been described that could be suitable for moderate throughput non radioactive 

10 assays, or filtration of the reactions through appropriate filters may efficiently 
separate the decatenated species from KDNA. Although the above described 
assays were used for topoisomerase IV, modified decatenation reactions using 
KDNA isolated from Leishmania donovani reveal significant decatenation activity by 
gyrase from E. coli and Mycobacterium smegmatis, indicating the applicability of the 

15 assay to prokaryotic gy rases. 

DNA Supercoiling/relaxation assays. 

DNA gyrase function is directly assayed using a simple supercoiling assay 
typified by that described for the measurement of Escherichia coli DNA gyrase 

20 activity. Briefly, incubation of relaxed closed circular plasmid DNA (pUC18, 7.5 nM) 
in the presence of DNA gyrase (approximately 10 nM) in 40 mM Tris-HCI (pH 8.0) 
buffer containing 25 mM KCI, 4 mM MgCI2, 2.5 mM spermidine and 1 .4 mM ATP 
buffer results in the introduction of supercoils in the plasmid DNA. Changes in DNA 
supercoiling status are readily observed by the alteration of mobility of the DNA in 

25 agarose gels stained with ethidium bromide and comparison to the mobility of relaxed 
and supercoiled plasmid template. This strategy is employed for screening for DNA 
gyrase inhibitors. 

Topoisomerase IV activity is assayed by measuring relaxation of supercoiled 
plasmid DNA. A typical relaxation assay used for S. aureus topoisomerase IV 
30 activity is as follows: topoisomerase IV enzyme and supercoiled plasmid DNA 
(pBR322, 0.6 pg) is incubated in 40 pi 50 mM Tris-HCI, pH 7.7, containing 5 mM 
MgCI* 5 mM DTT, 50 pg/ml bovine serum albumin, 1.5 mM ATP, 5 mM spermidine 
and 20 mM KCI, for 30 min at 37°C. Changes in DNA supercoiling status can be 
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readily observed by the alteration of mobility of the DNA in agarose gels stained with 
ethidium bromide and comparison to the mobility of relaxed and supercoiled plasmid 
template 

The ATPase activity of topoisomerases is measured using a coupled 
5 spectrophotometric ATPase assay described for the GyrB subunit of E. coli. ATPase 
activity is assayed in 300 pi of 40 mM Tris-HCI (pH 8.0), containing 25 mM KCI, 2.5 
mM spermidine, 4 mM MgCI2, 400 pM phosphoenolpyruvate, 250 pM NADH, 3 pl of 
pyruvate kinase /lactate dehydrogenase mix and ATP (0.5 - 3.5 mM). The reaction 
is started by the addition of truncated N-terminal derivatives of the GyrB protein (5 
10 pM) containing the ATPase domain. ATPase activity is reflected as a decrease in 
absorbance of light at 340 nanometer wavelength. 

DNA cleavage assay. 

Quinolone drugs interfere with the DNA strand breakage-ligation cycle activity 

15 of many topoisomerases. Incubation of topoisomerase and linear or supercoiled 

pBR322 plasmid DNA, or small linear DNA fragments, in the presence of quinolones 
and magnesium results in the trapping of a complex of topoisomerase, DNA with a 
double stranded break and the drug. The topoisomerase remains bound to the 
cleaved DNA, however treatment with a denaturant such as SDS or proteinases 

20 remove/degrade the gyrase, releasing the cut DNA. Certain consensus sequences 
representing preferred cut sites of E. coli gyrase in plasmid pBR322 have been 
identified in template DNA molecules used in these assays. This assay is useful for 
mode of action studies of inhibitors of gyrase/topoisomerase IV activity and in 
particular of the strand breakage-ligation function. Cleavage reactions are performed 

25 with linear or supercoiled DNA. A typical cleavage reaction using linear DNA to 
measure cleavage by E. coli and S. aureus gyrase and topoisomerase IV in the 
presence of drugs is as follows: gyrase/ topoisomerase IV is incubated in 20 pl 25 
mM Tris-HCI (pH 7.5) containing 0.5 mM EDTA, 0.5 mM DTT, 3 pg bovine serum 
albumin per ml, 10 mM MgCI 2 , 120mM KCL 10 mM ATP, 10 000 dpm of 3' end 

30 labeled linear pBR322 plasmid DNA and drug for 1 hour at 37°C. (Note: for S. 

aureus, KCI is replaced with 0.7 M potassium glutamate). Reactions are terminated 
by adding 5 pl 2.5% SDS-2.5 mg proteinase K per ml and incubating at 37°C for 30 
minute, then adding 5 pl 30% glycerol-1% SDS-50 mM EDTA-0.05 % bromophenol 
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blue. Cleavage products are resolved on 1% agarose gels and visualized by 
autoradiography. 

Additional cleavage assays are also used that measure 1 ) the linearization of 
supercoiled plasmid DNA (pBR322), with linearization measured using scanning 

5 densitometry of DNA species separated on 1 % agarose gels, or 2) the cleavage of 
small linear DNA molecules of approximately 100 bp encompassing the preferred 
cleavage sequence 5'- GGCTGGATGGCCTTCCCCAT - 3' from position 990 in 
plasmid pBR322. In the latter case, the fragment is produced by PCR and 
radiolabeled with y- 32 P ATP at the 5' end of the top strand. This DNA is incubated 

10 with 1 .3 pmoi DNA gyrase in a total volume of 10 pi 35 mM Tris-HCI (pH 8.0), 24 mM 
KCI, 2 mM spermidine, 4 mM MgCI2 and inhibitor compound at 37°C for 10 min. 
Reactions are stopped by addition of 8 mM EDTA and 1% SDS, then treated with 
500 pg/ml proteinase K for 2 hours at 37°C. The DNA is then cleaned by phenol- 
chloroform extraction and ethanol precipitation, resuspended in TE buffer (pH 8.0), 

15 and loaded and resolved on 12 % sequencing gels containing 7M urea. In the 

presence of inhibitors of the strand breakage-ligation function, radioactive cleavage 
products are detectable by autoradiography. Modifications of this assay whereby 
one strand of the DNA substrate is labeled with an affinity tag such as biotin and the 
other is radiolabeled or fluorescently labeled should facilitate rapid separation and 

20 detection of cleavage products using streptavidin coated columns or plates, resulting 
in higher assay throughput. 

Gyrase activity assays: DNA replication: 

Early work by Fuller and Kornberg revealed that a partially purified crude 

25 soluble fraction derived from Escherichia coli cells (designated fraction II) contained 
the components necessary for replication of plasmids containing oriC (E. coii 
chromosomal origin of replication). Replication mediated by this fraction specifically 
required supercoiled plasmids. Although the exact makeup of the protein complex 
mediating the replication was not known, the replication reaction was inhibited by 1 ) 

30 rifampicin, and 2) nalidixic acid and novobiocin, indicating essential roles for both 
RNA polymerase and DNA gyrase, respectively. Subsequently the reaction was 
reproduced using replication machinery reconstituted from purified protein HU, DnaA, 
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DnaC, DnaB, single stranded binding protein (SSB), primase, DNA polymerase 
holoenzyme, RNA polymerase hoioenzyme and GyrA/GyrB. 

The requirement for gyrase activity for replication is exploited for the 
identification of gyrase inhibitors using a replication-based high throughput screen. 

5 Gyrase specific inhibitors are identified from the overall pool of replication inhibitors 
using the secondary assays detailed below. Screening for inhibitors of gyrase in a 
setting where gyrase is participating in an overall reaction that is essential in bacteria 
might better select physiologically relevant inhibitors 

An assay suitable for high throughput screening of inhibitors of replication 

10 (including gyrase and DnaA inhibitors) is based on the replication reaction of Kaguna 
and Kornberg. This reaction was set up as follows; standard reaction in 25 pi: 40 mM 
Hepes (pH 7.6), 2 mM ATP, 0.5 mM GTP, CTP and UTP, 50 pg/ml bovine serum 
albumin, 6 mM phospho creatine, 100pM dATP, dGTP, dCTP and dTTP, y-^P dTTP 
(50-150 cpm/pmol total nucleotides) 11 mM magnesium acetate,100 pg/mL creatine 

15 kinase,85 ng SSB, 48 ng DnaB, 40 ng DnaC, 20 ng primase, 160 ng DNA 

polymerase III holoenzyme, 800 ng RNA polymerase, 150 ng GyrA, 350 ng GyrB, 
120 ng DnaA, 2.5 units topoisomerase 1, 190 ng HU, 0.15 ng Rnase H 200 ng 
supercoiled plasmid template. The reaction is assembled at 0 °C and initiated by 
incubation at 30°C. Replication reactions are terminated by the addition of EDTA to 

20 20 mM. Incorporation of nucleotides into DNA is measured by filtration through 96 
well DEAE filter plates and counting retained radioactivity. 

Compounds inhibiting gyrase activity in Alloiococcus otitidis are found as part 
of a larger program directed at replication. This reaction described above uses the 
replication machinery of a gram-negative organism, which differs somewhat from the 

25 replication machinery of gram positives such as Staphylococcus aureus with respect 
to the specific protein subunits involved. Therefore a similar system specific to 
Alloiococcus otitidis is assembled from the relevant proteins purified from 
Alloiococcus otitidis. Several techniques are then utilized to determine the interaction 
of inhibitors with Gyr A and GyrB. These are described in the literature and include 

30 a) Nuclear magnetic resonance; and b) Capillary electrophoresis.. 
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Example 23 

ALLOIOCOCCUS OT1TIDIS ENCODED CELL WALL BIOSYNTHETIC ENZYMES MURA 

Bacteria! cell wall peptidoglycan (murein) is a large macromolecule of periodic 

5 structure whose basic unit, a disaccharide-peptapeptide, is polymerized linearly via 
the disaccharide motif and cross-linked laterally via the peptide motif. The process of 
bacteria cell wall biosynthesis starts from the transferase MurA, which transfers the 
addition of an enolpyruvyl moiety to the 3'-hydroxyl-UDP-N-acetyl glycosamine 
(UDP-GiuNAc). Subsequently, the reductase MurB reduces the enol ether to the 

10 lactyl ether, utilize one equiv. of NADPH and a solvent proton to form UDP-A/-acetyl 
muramic acid (UDP-MurNAc). Next a series of ATP dependent amino acid ligases 
(MurC, MurD, MurE and MurF) catalyze the stepwise synthesis of the pentapeptide 
side chain using the newly synthesized carboxylate as the first acceptor site. Each 
enzyme is responsible for the addition of one more residue except MurF, catalyzes 

15 D-ala-D-ala. MurE in gram negative bacteria catalyzes the meso-2, 6- 

diaminopimelate (DAP), while in gram positive bacteria MurE catalyzes L-lysine. 

The product of MurF, UDP-NAM pendapeptide is the final product of the 
cytoplasm enzymes and is the most important precusor for further peptidoglycan 
biosynthesis. UDP-MurNAc pendapeptide is then and catalyzed at the plasma 

20 membrane by the membrane bound enzymes such as the translocase MraY and 
transferase MurG. 

UDP-/V-acetylglucosamine enolpyruvyl transferase (MurA) catalyzes the first 
committed step in bacterial cell wall biosynthesis. The enzyme transfers ah 
enolpyruvyl group from phosphoenolpyruvate (PEP) to UDP-N-acetylglucosamine 

25 (UDP-GluNAc) to the 3-OH of UDP-GlcNAc by an addition-elimination mechanism 
that proceeds through a tetrahedral ketal intermediate. MurA product enolpyruvate 
UDP-A/-acetylglucosamine (EP-UNAG) is a precursor to UDP- N-acetylmuramate 
(UDP-MurNAc), an essential building block for the bacterial cell wall. MurA is 
conserved across both gram-positive and gram-negative bacterial species: gram- 

30 negative bacteria have one copy of the murA and gram-positive bacteria have two 
copies. Alfoiococcus otitidis murA was identified as described in Example 5/Table 4 
and its genomic structure set forth in Seq. ID No. 101- The amino acid sequence of 
the protein encoded by this gene is set out in Seq. Id No. 102. 
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Alloiococcus otitidis murA as a target for anti-infective development 

MurA in E. coli and Streptococcus pneumoniae has been shown to be 
essential by gene deletion technique. The essentiality of MurA in gram-positive 
bacteria such as Streptococcus pneumoniae was demonstrated in that its deletion is 
5 fetal. No mammalian homolog to MurA has been reported. MurA is specifically 

inhibited by the natural product antibiotic fosfomycin. Thus the importance of MurA in 
peptidoglycan biosynthesis makes it an attractive target for the design of novel 
antibacterial agent. 

10 Assays for measuring MurA function 

Phosphate detection: 

MurA activity is detected by quantitating the UDP-GluNAc-dependent Pi from 
PEP and assayed by Lanzetta's malachite Green-ammonium molybdate assay. Pi is 
15 quantitated by measuring the optical density at A660 nm. 

Coupled assay with MurB: 

A coupled assay in access of MurB, which reduces the MurA product EP- 
UNAG G to UDP-MurNAc, couples the MurA transferase activity with NADPH 
20 oxidation. The oxidation of NADPH is monitored at 340 nm and is stoichometric with 
the production of EP-UNAG. 

Fluorescence experiments 

Fluorescence experiments to detect murA are performed using the 
25 hydrophobic fluorescence probe 8-anilino-1 -naphthalene sulfonate (ANS). The 
fluorescence quenching of MurA/ANS solutions upon addition of UDP-GlcNAc or 
pyruvate-P is concentration dependent and in a saturating manner. 

Isothermal titration calorimetry 

30 The binding of UDP-GluNAc to MurA is studied in the absence and presence 

of the antibiotic fosfomycin by isothermal titration calorimetry. Fosfomycin binds 
covalently to MurA in the presence of UDP-GluNAc and also in its absence as 
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demonstrated by MALDI mass spectrometry. Novel Fosfomycin analogs and other 
antibiotics that bind to murA are also identifiable using isothermal titration chemistry. 

Capillary electrophoresis-based enzyme assay 

5 A capillary electrophoresis-based enzyme assay for MurA is described by Dai 

and colleagues . This method, based on UV detection, provides baseline separation 
of one of the reaction products, EP-UNAG, from substrates PEP and UDP-GlcNAc 
within 4 min. The other product, phosphate, is not detectable by UV at 200 nm. 
Quantitation of individual components, substrates or product, is be accomplished 

10 based on the separated peaks. This assay is also used to detect novel antibiotics, 
which inhibit murA activity. 

Example 23 

ALLOIOCOCCUS OTmPIS ENCODED CELL WALL BIOSY NTH ETIC ENZYMES MURB 

15 

MurB, the UDP-/V-acetyI enolpyruvyl glucosamine reductase, commits the second 
step of bacterial cell wall biosynthesis in cytoplasm and is responsible for the reduction of 
the enol ether to the lactyl ether, utilizes one equiv. of NADPH and a solvent proton. The 
product of MurB is UDP-N-acetylmuramic acid (UDP-MurNAc), the linker of the peptide 

20 and glycan portions of cell wall precursor UDP muramyl-pentapeptide. MurB from E. coli 
is a 342 amino acid protein, which has a distinctive yellow color characteristic of bound 
flavin as its co-factor. The biochemistry characterization and X-ray crystal structure of 
MurB in E. coli, in Staphylococcus aureus and Streptococcus pneumoniae have been 
studied extensively. The gene Alloiococcus oitidis murB was identified as disclosed as 

25 described in Example 5, and is set out in Seq. ID No. 39. The amino acid sequence of 
the protein encoded by this gene is set out in Seq. ID No. 40. 

Alloiococcus oitidis murB as a target for anti-infective development 

30 The essentiality and unique function of MurB in prokaryotic cells and the 

absence of homologue in eukaryotic cells make it an attractive novel antibacterial 
target. To date, no small molecule inhibitors of MurB have been reported. 
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Alloiococcis oititidis ORF-1263 {murB ) (Seq. ID No. 39) encodes enzyme 
UDP-N-acetylenolpyruvylglucosamine Reductase (MurB) as shown by sequence 
homology. 

5 Assays for measuring MurB activity 

Spectrophotometry assay monitoring NADPH consumption: 

MurB activity is typically monitored by its biochemical reaction in which 
NADPH reduces the bound FAD and resulting decrease in absorbance at 340 nm. 
Enzyme is maximally activated in the presence of K+, NH 4 at cation concentrations 
10 between 10-50 mM. 



Coupled assay with MurC: 

In designing an end point assay for high through put screen (HTS), a novel 
coupled assay in access of UDP-MurNAc L-alanine synthase (MurC) was developed 

15 at Wyeth. This assay utilizes the biochemically synthesized MurA product EP-UNAG 
as substrate, coupled with limited MurB and excess MurC in the reaction with all 
other substrates/components involved. In this assay, MurB is responsible for the 
reduction of the enol ether to the lactyl ether, and the follow up enzyme MurC 
catalyzes the ATP dependent ligation of the first of the five amino acids of UDP- 

20 peptapeptide with a release of one molecule of phosphate. After 60 minutes of 
incubation, color reagent malachite green was added and phosphate was detected 
spectrophotometrically. 

Fluorescence binding assay 

25 A fluorescence method developed at Wyeth is used to determine the binding 

potency (Kd value), stoichiometry and nature of binding site of substrates and 
inhibitors interactions with MurB enzymes. This assay is based on changes in 
intrinsic fluorescence of inhibitor and/or enzyme, upon formation of enzyme-inhibitor 
complex. Oxidized form of MurB consists of two fluorescent groups, namely 

30 tryptophan residues and the cofactor FAD. Upon binding inhibitor or substrate, local 
changes in the solvent environment of these groups or overall conformational and 
electronic changes occur in the enzyme due to which the fluorescence emission is 
altered. For instance, inhibitor binding significantly quenched the fluorescence and 
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altered the solvent environment of FAD to a less polar environment. The changes in 
the fluorescence of the FAD moiety are used to estimate binding constants for MurB 
inhibitors. Binding experiments are set up in which a fixed concentration of enzyme is 
titrated with increasing concentrations of the inhibitor. In typical inhibitor binding 
5 experiments, the fluorescence emission of the FAD moiety is quenched due to 
specific interactions of the inhibitor with MurB enzymes and the binding site was 
saturated at micromolar concentrations of inhibitor. The changes in the fluorescence 
are fitted to mathematical binding models to determine binding affinity. 

10 Temperature-jump isothermal denaturation procedure 

Temperature-jump isothermal denaturation procedure with various methods 
of detection is used to evaluate the quality of putative inhibitors of MurB discovered 
by high-throughput screening. Three optical methods of detection-ultraviolet 
hyperchromicity of absorbance, fluorescence of bound dyes, and circular dichroism- 
15 as well as differential scanning caiorimetry are used to dissect the effects of two 
chemical compounds and a natural substrate on the enzyme. The kinetics of the 
denaturation process and binding of the compounds detected by quenching of flavin 
fluorescence are used to quantitate the dose dependencies of the ligand effects. 

20 NMR studies 

NMR studies are performed using perdeuterated, uniformly 13C/15N-labeled 
samples of MurB. In the case of substrate-free MurB, one or more backbone atoms 
are assigned for 334 residues (96%). For NADP+-comp1exed MurB, one or more 
backbone atoms are assigned for 313 residues. The strategies used for obtaining 
25 resonance assignments are known. Localizing the NADP+ binding site on the MurB 
enzyme is also studied by NMR methodology. 

Example 25 

ALLOrOCOCCUS OT777P/S ENCODED CELL WALL B1QSYNTHET1C ENZYME, MURC 

30 

Uridine diphosphate-N-acetylmuramate:L-alanine ligase (MurC) catalyzes the 
third chemical step of bacterial cell wall biosynthesis. This enzyme is a nonribosomal 
peptide ligase which utilize ATP to form an amide bond between L-alanine and UDP- 
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10 



N-acetylmuramic acid (UDP-MurNAc). This ATP-dependent ligation adds the first of 
five amino acids to the sugar moiety of the peptidoglycan precursor. Also, in this 
reaction, ATP is converted to ADP with release of one molecule of inorganic 
phosphate. Thus MurC reaction is an essential step in cell wall biosynthesis for both 
gram-positive and gram-negative bacteria. The genetic, biochemistry analysis and 
crystal graphic studies of MurC in gram-negative bacteria E. coli have been 
extensively studied. Characterizations of MurC in other pathogens such as 
Staphylococcus aureus and Pseudomonas aeruginosa have also been documented. 

Alloiococcis otitidis encoded MurC as a target for anti-infective development 



The Alloiococcis otitidis ORF-2602 (murC, Seq. ID No. 95) encodes enzyme 
UDP-MurNAc:L-alanine ligase (Mu/C) as determined by sequence homology. This 
enzyme presents a target for the development of novel anti-infectives to treat the 
15 disease(s) caused by this pathogen. Novel compounds identified using combinatorial 
chemistries are assayed for their inhibitory effect on MurC activity using one of the 
asssays set out below. 

Assays for measuring MurC activity 

20 Spectrophotometric assay detecting phosphate release: 

MurC activity is detected by the inorganic phosphate production. Typically 
the reaction mixture contains substrates ATP, L-alanine, UDP-MurNAc, DTT, MgCI 2 
and MurC enzyme. After 20 minutes incubation, the reaction is quenched with the 
addition of malachite Green-ammonium molybdafe for a colored reaction. 

25 Absorbance at 660 nm is read 5 minutes after the quench. Absorbance values are 
converted to concentration of Pi with standard curves using KH 2 P0 4 , which is 
prepared under identical conditions without the enzyme MurC. 

Spectrophotometric assay detecting formation of ADP 

30 Due to the conversion of ATP to ADP in MurC reaction, the production of 

ADP is monitored in coupled enzymes spectrophotometricaily. In this reaction, in 
addition to MurC substrate UDP-MurNAc, L-alanine and ATP, NADH, 
phosphoenolpyruvate, MgCI 2 and (NH 4 ) 2 S04, two other coupled enzymes pyruvate 
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kinase and lactase dehydrogenase are also presented. Reaction mixtures without 
ATP and MurC are incubated at 37°C for 10 min before ATP is added for another 
minute. Reaction is then started by the addition of MurC. The decrease of NADH 
absorbance at 340 nm is monitored spectrophotometrically. One unit of activity 
5 corresponds to 1 umol of ADP formed per hour. 

L-Alanine radio-labeled assay: 

The MurC enzyme activity in this assay is measured as endpoint using 14 C-L- 
alanine and ATP incubated with MgCI 2 , and (NH 4 )2S0 4 in 100 mM Tris/HCI, pH 8.0. 
10 Reaction is initiated by the addition of the catalytic amounts of MurC. Samples of the 
reaction mixture are then mixed with glacial acetic acid and then stored at 4°C. 
Remaining 14 C -L-alanine is separated from 14 C -UDPMurNAc on SCX columns run 
under vacuum. Quenched reaction samples are supplemented with equilibration 
buffer and counted using a liquid scintillation counter. 

15 

Example 26 

ALLO/OCOCCUS OTITIDIS ENCODED CELL WALL BIOSYNTHETIC ENZYMES MURD 

Bacterial UDP-N-acetylmuramyl-L-alanine:D-glutamate ligase (MurD), a 
20 cytoplasmic peptidoglycan biosynthetic enzyme, catalyzes the fourth step of bacterial 
cell wall biosynthesis. In this reaction, MurD catalyzes ATP-dependent addition of D- 
glutamate to an alanyl residue of the UDP-N-acetylmuramyl-L-alanine (UDP- 
MurNAc-L-Ala) precursor, generating the UDP-MurNAc-dipeptide. The formation of a 
peptide linkage between the amino function of D-glutamate and the carboxy 
25 terminius of UDP-N-acetylmuramuamyl-L-alanine is generated through this reaction. 
The stoichiometric consumption of ATP supplies the energy needed for this peptide 
bond formation with concomitant generation of ADP and orthophosphate. The murD 
genes were cloned and characterized from gram-positive bacteria of Staphylococcus 
aureus and Streptococcus pyogenes, and gram-negative bacteria from Escherichia 
30 co// t Haemophilus influenzae, Bacillus subtilis. Structures of MurD from E coli and 
MurD complexed with its substrate UDP-MurNAc-L-Ala have been solved to 2.0 A 
resolution. The role of specific amino acids at the active site of MurD have been 
extensively studied using the ortholog and paralog amino acid invariants. Homologue 
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of this gene identified in Alloiococcus otitidis is described in Example 5/T able 4 (Seq. 
ID No 89). The protein encoded by the gene is set forth in Seq. ID No. 90. 

Alloiococcus otitidis encoded MurD as a target for anti-infective development 

5 

Due to its high specificity and essentiality, MurD is an attractive target for the 
development of novel antimicrobial agents. Alloiococcis otitidis ORF-2494, by 
sequence homology, has been shown to encode enzyme UDP-N-acetylmuramy!-L- 
alaninerD-glutamate ligase (MurD) (Seq. ID. No. 89). Inhibition of MurD activity is 
10 used to identify novel antimicrobial agents. 

Assays for measuring MurD activity 

Spectrophotometric assay detecting phosphate release: 

15 MurD activity in the presence or absence of a putative inhibitory molecule of 

MurD is detected by the orthophosphate production in test tube or in 96-well format. 
Typically the reaction mixture contains substrates ATP, D-glutamine, UDP-MurNAc- 
L-Ala, DTT, MgCI2 and MurD enzyme. After 20 minutes incubation, the reaction is 
quenched with the addition of malachite Green-ammonium molybdate for a colored 

20 reaction. Absorbance at 660 nm is read 5 minutes after the quench using Molecular 
Devices SpectraMax 250 plate reader. Absorbance values are converted to 
concentration of Pi using orthophosphate standards, which are prepared under 
identical conditions without the enzyme MurD. 

25 

Spectrophotometric assay for detecting formation of ADP in the presence or 
absence of a putative inhibitory moliecule of MurD: 

Due to the conversion of ATP to ADP in MurD reaction, the production of 
ADP is monitored with coupled enzymes of pyruvate kinase and lactase 
30 dehydrogenase spectrophotometrically. In this reaction, in addition to MurD 
substrate UDP-MurNAc-L-ala and ATP, MgCI 2 and (NH 4 )2S0 4 , there is also in 
significant access of NADH, phosphoenolpyruvate, and two coupled enzymes 
pyruvate kinase and lactase dehydrogenase. This protocol monitors ADP formation 
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in the MurD catalyzed reaction, in the presence or absence of a putative inhibitory 
mollecule of MurD, by the decrease of NADH absorbance at 340 nm. 

L-Glutamate radio-labeled assay: 
5 The MurD enzyme activity in the presence or absence of putative inhibitors of 

MurD is also measurable using D- 14 C- glutamate as an endpoint assay. The reaction 
mixture contains D- 14 C- glutamate UDP-MurNAc-L-Ala, ATP, MgCI 2 , (NH 4 ) 2 S0 4 in 
100 mM Tris/HCI, pH 8.0. An HPLC assay with online UV and flow scintillation 
detects the formation of UDP-MurNAc-L-Ala-D- 14 C Glu and ADP in each reaction. 

10 

Example 27 

ALLOIOCOCCUS OTITIDIS ENCODED CELL WALL BIOSYNTHET IC ENZYME, MURE 

The fifth step in the cytoplasmic peptidoglycan biosynthetic is catalyzed by 

15 MurE. In this step, the monomer units in the Escherichia coli and Staphylococcus 
aureus cell wall peptidoglycans differ in the nature of the third amino acid in the L- 
alanyl-gamma-D-glutamyl-X-D-aianyl-D-aianine side chain, where X is meso- 
diaminopimelic acid or L-lysine, respectively. Therefore, MurE from E. coli\s the 
UDP-N-acetylmuramoyl-L-alanyl-D-glutamate: meso-diaminopimelic acid ligase, and 

20 MurE from S. aureus is the UDP-N-acetylmuramoyl-L-alanyl-D-glutamate: L-lysine 
ligase. Thus represents the major difference of MurE from other murein enzymes in 
cytoplasm. The amino acid residues catalyzed by MurE plays a key role in the 
integrity of sacculus since it is directly involved in the peptide cross-linkage. MurE 
reaction is also ATP-dependent, which supplies the energy needed for the peptide 

25 bond formation with concomitant generation of ADP and orthophosphate. 

The essentiality of MurE has been well documented in E. col'u in S. aureus, as 
well as other pathogens such as Haemophilis influenzae, Vibrio choierae and 
Corynebacterium glutamicum. Gene murE has been shown to be essential in 
bacteria. Homologue of this gene identified in Alloiococcus otitidis is described in 

30 Example 5/Table 4 (Seq. ID No 25). The protein encoded by the gene is set forth in 
Seq. ID No. 26. 

Alloiococcus otitidis MurE as a target for anti-infective development 



123- 



WO 03/104391 



PCT/US02/36122 



Alloiococcis otitidis ORF-851 , by sequence homology encodes enzyme UDP- 
N-acetylmuramyl-L-alanine-D-glutamate ligase: meso-diaminopimelic acid/or L- 
Lysine (MurE) (Seq. ID No 25). MurE activity in the presence or absence of a 
5 putative inhibitory molecule of MurE activity is used to identify novel antimicrobial I 
agents, which may be used ti treat disease caused by Alloiococcis otitidis. 

Assays for measuring MurE activity 

Radio labeled substrate assay: meso-A2pm-adding activity 

10 Activity of MurE from Alloiococcis otitidis in the presence or absence of a 

putative inhibitory molecule of MurE activity is measured by using radio-labeled 
meso- 14 C A2pm mixing with ATP, MgCI 2> UDP-MurNAc-L-Ala-D-Giu, DTT in 100 mM 
Tris/HCI and MurE from Alioiococcis otitidis . 

15 Radio labeled substrate assay: L-lysine adding activity 

Activity of MurE from Alloiococcis otitidis in the presence or absence of a 
putative inhibitory molecule of MurE activity is measured by using radio-labeled UDP- 
MurNAc-L-Ala-D-14C-Glu mixing with ATP, MgCk DTT, L-lysine in 100 mM Tris/HCI 
and MurE from Alloiococcis otitidis. 

20 In both cases, mixtures are incubated at 37°C for 30 min, and reactions 

stopped by the addition of acetic acid. Reaction product is separated by high votage 
electrophoresis in 2% formic acid for 45 min. The radio active spots corresponding to 
substrate and reaction product are detected by overnight autoradiography, or with 
radio scanner. The spots are also cut out and counted using liquid scintillation 

25 counter. 

Example 28 

Alloiococcus otitidis encoded cell wall biosynth etic enzyme, MurF 

The D-alanyl-D-alanine-adding enzyme MurF encoded by the murF gene 
30 catalyzes is the last step of the cytoplasmic peptidoglycan biosynthesis. MurF 

performs the ATP-dependent formation of UDP-N-acetylmuramyl-L-gamma-D-Glu- 
meso-diaminopimelyi-D-Ala-D-Ala (UDP-MurNAc-pentapeptide). The product of 
MurF, UDP-MurNAc pendapeptide, is the final product of the cytoplasm enzymes and 
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is the most important precusor for further peptidoglycan biosynthesis. UDP-MurNAc 
pendapeptide is then catalyzed by the plasma membrane bound enzymes such as 
the translocase MraY and transferase MurG. Homologue of this gene identified in 
Alloiococcus otiiidis is described in Example 5/Table 4 (Seq. ID No 3). The protein 
5 encoded by the gene is set forth in Seq. ID No. 4. 

Alloiococcus otitidis MurF as a target for anti-infective development 

Due to its high specificity, essentiality, and importance of its product UDP- 
MurNAc pentapeptide, MurF is attractive as an antibacterial target. The Alloiococcis 
10 otitidis ORF-48, by sequence homology,encodes enzyme UDP-N-acetylmuramyl-L- 
alanine-D-glutamate ligase: meso-diaminopimelic acid/or L-Lysine -alanyl-D-alanine- 
adding enzyme (MurF) (Seq. ID No. 3). MurF activity in the presence or absence of a 
putative inhibitory molecuie of MurF activity is used to identify novel antimicrobial 
agents, which may be used to treat disease caused by Alloiococcis otitidis. 



15 



Assays for measuring MurF activity 



Spectrophotometric assay detecting phosphate release: 

Activity of MurF from Alloiococcis otitidis in the presence or absence of a 
20 putative inhibitory molecule of MurF activity is detected by the inorganic phosphate 
release in the ATP dependent MurF reaction. This assay detects nonomole amount 
of Pi in the reaction mixture contains substrates ATP, D-ala-D-ala, UDP-MurNAc- 
tripeptide, DTT, MgCI 2 and MurF enzyme. After 5 minutes incubation, the reaction is 
quenched with the addition of malachite Green-ammonium molybdate for a colored 
25 reaction. 

Coupled spectrophotometric assay detecting formation of ADP 

Due to the conversion of ATP to ADP in MurF reaction, the production of ADP 
in the presence or absence of a. putative inhibitory molecule of MurF activity, is 
30 monitored with coupled enzymes of pyruvate kinase and lactase dehydrogenase 
spectrophotometrically. In this reaction, the decrease at 340 nm is observed as 
NADP is consumed in MurF reaction process. The reaction typically contains tris 
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buffer, substrates ATP, D-ala-D-ala, UDP-MurNAc-tripeptide, DTT, MgCI 2 , 
phosphoenopyruvate, NADPH and MurF en2yme. 

Example 29 

5 ALLOIOCOCCUS OTITlDiS ENCODED CELL WALL BIOSYNTHET IC ENZYME, MURG 

MurG, the last enzyme involved in the intracellular phase of peptidoglycan 
synthesis, is a membrane-associated glycosyltransferase. MurG catalyzes the 
transfer of /^acetyl glucosamine from UDP to the C4 hydroxyl of a lipid-linked N- 

10 acetyl muramic acid derivative (lipid I) to form lipid II. Lipid II is a linked disaccharide 
that is the minimal subunit of peptidoglycan. Once lipid II is formed, this disaccharide 
is translocated across the bacterial membrane where it is polymerized and cross- 
linked to form the peptidoglycan layers. MurG has been shown to be essential for 
bacterial survival. The inactivation of MurG gene rapidly inhibits peptidoglycan 

15 synthesis in exponential growing cells. As a result, various alterations of cell shape 

i 

are observed, and cell lysis finally occurs. Homologue of this gene identified in 
Alloiococcus otitidis is described in Example 5/T able 4 (Seq. ID No 87). The protein 
encoded by the gene is set forth in Seq. ID No. 88. 

20 Alloiococcus otitidis MurG as a target for anti-infective development 

MurG is shown to be associated with the inner face of cytoplasmic 
membrane, and establishing that the entire peptidoglycan monomer unit assembled 
before being transferred across the membrane. MurG is a key enzyme at the border 

25 line between cytoplasmic and membrane of pepdidoglycan synthesis, thus makes it 
an attractive target for novel antibacterial agent. Further, no mammalian analogues 
of MurG have been identified. Due to its high specificity, essentiality, and importance, 
MurG is attractive as an antibacterial target. 

The Alloiococcis otitidis ORF-2492 has been shown to encode, by sequence 

30 homology, glycosyltransferase (MurG) (Seq. ID No ). MurG activity in the 

presence or absence of a putative inhibitory molecule of MurG activity is used to 
identify novel antimicrobial agents, which may be used to treat disease caused by 
Ailoiococcis otitidis. 
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Assays for measuring MurG function 

Radiolabeled reaction 

Activity of MurG from Alloiococcis otitidis in the presence or absence of a 
putative inhibitory molecuie of MurG activity is measured by using 14 C labeled N- 
UDP-GluNAc in the reaction containing UDP-MurNAc-pentapeptide, MgCI 2 , ATP and 
MurG protein. The reaction is stopped after 30 min incubation and by boiling for 3 
min. The reaction mixtures are applied to a Whatman I filter paper and subject to 
descending chromatography overnight. Radioactivity is located and countered with a 
scanner. This assay is also used to identify the specificity of inhibitor of MraY or 
MurG, based on the detection of radiolabeled 14 C GluNAc incorporated into 
membrane precursors. 

Fluorometric assay 

Based on the decrease in NADPH fluorescence at 465 nm, MurG reaction is 

also monitored in a reaction mixture of HEPES buffer, MgCI 2> Triton, 
phosphoenolpyruvate, and coupled enzymes of lactic dehydrogenase and pyruvate 
kinase, UDP-GluNAc and synthesized lipid I analogue in the presence or absence of 
putative inhibitors of MurG activity. One micromolar UDP corresponds to 500- 
fluorescence unit under the instrument setting. 

Example 30 

A, I niOCOCCUS OTITIDIS ENCODF Q RY HMG CO A REDUCTASE (MVAA) 

Two pathways for isopentenyl diphosphate (IPP) synthesis have been 
described in bacteria: the mevalonate pathway and the non-mevalonate (MEP or 
GAP-pyruvate) pathway. The mevalonate pathway predominates in the 
archaebacteria, gram-positive organisms, yeast and mammals; whereas the MEP 
pathway is found in gram-negative organisms, B. subtilis, chlamydia, and 
) mycobacterium. The first HMG CoA reductase gene to be sequenced was cloned 
from P. mevalonii, in which HMG CoA reductase permits growth on mevalonate as a 
sole carbon source. A number of genes of the mevalonate pathway were identified in 
S. aureus, S, epidermidis, S. pyogenes, S. pneumoniae, E. faecalis and E. faecium. 
One of the genes, which encodes for HMG-CoA reductase {mvaA), when deleted 
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severely attenuated for virulence in a mouse model indicating that mvaA is essential. 
Due to its high specificity, essentiality, and importance, mvaA is attractive as an 
antibacterial target. Homologue of this gene identified in Alloiococcus otitidis is 
described in Example 5/Tabte 4 (Seq. ID No 37). The protein encoded by the gene is 
5 set forth in Seq. ID No. 38. 

HMG-CoA reductase (MvaA) as a target for anti-infective development 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
10 homology, HMG-CoA reductase {mvaA) (Seq. ID No 37). MvaA activity in the 
presence or absence of a putative inhibitory molecule of HMG-CoA reductase 
{mvaA) activity is used to identify novel antimicrobial agents, which may be used to 
treat disease caused by Alloiococcus otitidis. 

15 Assays for measuring HMG-CoA reductase (mvaA) activity 

MvaA is purified by standard methods using widely available molecular tags 
following expression at high level from E. coli. Enzymatic activity is monitored in the 
presence or absence of a putative inhibitory molecule of HMG-CoA reductase activity 
by following oxidation of NADPH to NADP spectrophotometrically at 340 nm. The 

20 assay is carried out in the following buffer: 0.25 mM NADPH, 0.25 mM HMG-CoA, 50 
mM NaCI, 1 mM EDTA, 5 mM DTT, 25 mM KH 2 P0 4 (pH 7.5). The assay is 
amenable to HTS in high density screening microtiter plates. 

25 Forward reaction: Activity of HMG-CoA reductase {mvaA) from Alloiococcus 

otitidis in the presence or absence of a putative inhibitory molecule of HMG-CoA 
reductase activity is measured by reductive deacylation of HMG-CoA to mevalonate 
as measured the consumption of NADPH to NADP. Unlike other class II HMG Coa 
reductases, MvaA from Alloiococcus otitidis, like S. aureus, can use either NADPH or 

30 NADH cofactor in the reaction. The following kinetic data describe the reaction: 
Km(HMG coa) = 40 pM, KmfNADPH) = 70 MM. K^adp) = 100 pM (12). This assay is 
inhibitable by the statin drug fluvastatin; the K, was measured at 320 pM, which is 
four orders of magnitude higher than the Kj for class I HMG-Coa reductases. 
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Reverse reaction: The oxidative acylation of mevalonate to HMG-CoA in the 
presence or absence of a putative inhibitory molecule of HMG-CoA reductase activity 
is also monitored. The following kinetic data describes the reaction: K^mevaionate) = 
670 pM, Km(coASH) = 390 pM, K^nadp) = 580 [JM (12). 

5 

Example 31 

ALLOIOCOCCUS otttidis encoded diphosphomevalonate decarb oxylase ( MVAD) 

Diphosphomevalonate decarboxylase, encoded by mvaD, the final enzyme 
10 acting in the mevalonate pathway of IPP synthesis was cloned from S. aureus by 
Wilding et al in 2000. Insertional inactivation of mvaD could only be accomplished 
when the strains were supplemented with mevalonate, indicating that mvaD is 
essential. The final step of the mevalonate pathway leading to IPP is the 
decarboxylation and dehydration of mevalonate-5-pyrophosphate to form isopentenyl 
15 diphosphate by MvaD (diphosphomevalonate decarboxylase). 

MvaD homologues are well represented in gram-positive organisms (10). 
Phyiogenetic analysis revealed that the cluster of gram-positive enzymes (39-80% 
identity) were well separated from the eukaryotic homologues, suggesting utility as 
an antibacterial target. The Alloiococcis otitidis ORF- 1275b has been shown to 
20 encode, by sequence homology, diphosphomevalonate decarboxylase (MvaD; (Seq. 
ID No. 43). MvaD activity in the presence or absence of a putative inhibitory molecule 
of diphosphomevalonate decarboxylase (MvaD; activity is used to identify novel 
antimicrobial agents, which may be used to treat the disease(s) caused by 
Altoiococcus otitidis: The protein encoded by the gene is set forth in Seq. ID No. 44. 

25 
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Example 32 

Al LOIOCOCCUS OTWDIS ENCODED HfflCi CoA SYNTHASE fMVAS] 



The second step of the mevalonate pathway leading to IPP is the irreversible 
5 condensation of acetoacetyl-CoA and acetyl-CoA to form HMG-CoA by MvaS (HMG 
CoA synthase). It has been shown that mvaS knockout mutant of S. pneumoniae 
was attenuated for virulence: Due to its high specificity, essentiality, and importance, 
mvaS is attractive as an antibacterial target. Homologue of this gene identified in 
Alloiococcus otitidis is described in Example 5/Table 4 (Seq. ID No 35). The protein 
10 encoded by the gene is set forth in Seq. ID No. 36. 

HMG COA SYNTHASE (MVAS) AS A TARGET FOR ANTI-INFECTIVE DEVELOPMENT 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
15 homology, MvaS (HMG CoA synthase) (Seq. ID No. 35). MvaS activity in the 

presence or absence of a putative inhibitory molecule of HMG-CoA synthase (mvaS) 
activity is used to identify novel antimicrobial agents, which may be used to treat 
disease caused by Alloiococcus otitidis. 

20 Assays for measuring MvaS function 

MvaS is purified by standard methods using widely available molecular tags 
following expression at high level from E. coli. HMG-CoA synthase activity in the 
presence or absence of a putative inhibitory molecule of HMG-CoA synthase {mvaS) 
is assayed by measuring the loss of the enolate form of acetoacetyl-CoA 

25 spectrophotometrically. The reaction is carried out in a buffer containing 50 mM Tris 
(pH 9.75), 5.0 mM MgCI 2 , 500 uM acetyl-CoA, 20 uM acetoacetyl-CoA and enzyme. 
The enolate formed is monitored at 302 nm; therefore, as the acetoacetyl-CoA is 
consumed the signal is depleted. Using this assay the following kinetic data is 
measured: ^^=350 uM; = 10 pM. This assay is amenable 

30 to HTS in high- high density screening microtiter plates. 
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Example 33 




5 Nicotinamide adenine dinucleotide (NAD) is an essential molecule in all living 

cells. NAD is synthesized via a multi-step de novo pathway or via a pyridine salvage 
pathway. The enzyme nicotinic acid mononucleotide adenylyl transferase (NaMN AT, 
EC2.7.7.18) catalyzes the conversion of ATP and nicotinic acid mononucleotide 
(NaMN) to nicotinic acid adenine dinucleotide (NaAD). The nadD gene, encoding 

10 bacterial NaMN AT, is essential for NAD biosynthesis and bacterial cell survival. 
NadD contains well-conserved the nucleotidyl transferase consensus sequence 
(GXFXXXHXGH). The adenylyl transferase encoded by the nadD gene prefers 
NaMN over nicotinomide mononucleotide (NMN) as substrate. Due to its high 
specificity, essentiality, and importance, nadD is attractive as an antibacterial target. 

15 Homologue of this gene identified in Alloiococcus otitidis is described in Example 
5/Table 4 (Seq. ID No 91). The protein encoded by the gene is set forth in Seq. ID 
No. 92. 

NICOTINAMIDE ADENINE DINUCLEOTIDE ADENYLYL TRANSFERASE (NADD) 
20 AS A TARGET FOR ANTI-INFECTIVE DEVELOPMENT 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
homology, niotinomide adenine dinucleotide adenyl transferase (NadD) (Seq. ID No. 
91 ). NadD activity in the presence or absence of a putative inhibitory molecule of 
25 NadD activity is used to identify novel antimicrobial agents, which may be used to 
treat disease caused by Alloiococcus otitidis. 

Assays for measuring NadD function 
Discontinuous assay 

30 NadD activity in Alloiococcus otitidis is measured in the presence or 

absence of a putative inhibitory molecule of NadD activity. NadD converts 
nicotinic acid mononucleotide (NaMN) and adenosine triphosphate (ATP) to 
nicotinic acid dinucleotide (NaAD) and pyrophosphate (PPi). Each PPj 
molecule produced by the NadD reaction is then converted to two phosphate 

-131- 



WO 03/104391 



PCT/US02/36122 



(Pi) molecules in the presence of inorganic pyrophosphatase (PPase). The P, 
molecules present are quantitated with a malachite green reagent at 660 nm. 

HPLC-based assay: Enzyme activity is measured by HPLC quantitation 
of the reaction products. A neutralized aliquots from the reaction described 

5 above was injected into an HPLC system utilizing a 250 x4.6 mm Supelcosil 
LC-18 5um reversed-phase column. The elution conditions: 9 min at 100% 
buffer A (0.1 M potassium phosphate buffer, pH6.0,6 min at up to 1 2% buffer B 
(buffer a, containing 20% methanol, 2.5 min at up to 45% buffer B, 2.5 min at 
up to 100% buffer B, and hold at 100% buffer B for 5.5 min. The eluate 

10 absorbance was monitored at 254 nm. 

Continuous assay 

In bacteria, NadD combines nicotinic acid mononucleotide (NaMN) and 
adenosine triphosphate (ATP) to form nicotinic acid adenine dinucleotide (NaAD). 

15 NadE then converts NaAD into nicotinamide adenine dinucleotide (NAD) in the 

presence of ammonia and ATP. In the assay, the NAD product is reduced to NADH 
with alcohol dehydrogenase (ADH) and ethanol, thus permitting direct spectrometric 
detection of NADH at 340 nm wavelength. The coupled reaction above also includes 
inorganic pyrophosphatase (PPase) to prevent accumulation of the pyrophosphate 

20 byproduct from the consumption of ATP. 

Example 34 

Ai , mnnnCCUS OT777P/S ENCODED N1CQ T.NAM.de ADENINF niNUCLEOTIDE SYNTHASE 
(NAPE) 

NAD is a central compound in cellular metabolism. The final metabolic 
step in the pathway is conversion of nicotinamide adenine dinucleotide - product of 
NadD reaction - to NAD, a step catalyzed by the enzyme NAD synthetase (NadE). 
NaMN - substrate for NadD - can be formed by three different enzymatic reactions: 
30 in the de novo pathway from quinolinate, in Preiss-Handler salvage pathway from 
nicotinic acid, and in the nucleoside salvage pathway by deamindation of 
nicotinamide mononucleotide. In bacteria, there are no known alternatives for the 
metabolic steps between NaMN and NAD. Mutants blocked in these steps cannot be 
recovered as auxotrophs since the required metabolites are not taken up by cells. In 



25 
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the bacterial cells, the second substrate for NadE is ammonium, as opposed to 
glutamine for eukaryotes. NadE is an essential and conserved protein in the 
eubacterial nicotinamide adenine dinucleotide (NAD) biosynthesis pathway. 
Homologue of this gene identified in Alloiococcus otitidis is described in Example 
5/Table 4 (Seq. ID No 49). The protein encoded by the gene is set forth in Seq. ID 
No. 50. 

Assays for measuring NadE function: 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
homology, niotinomide adenine dinucleotide adenyl synthase (NadE) (Seq. ID No. 
49). NadE activity in the presence or absence of a putative inhibitory molecule of 
NadE activity is used to identify novel antimicrobial agents, which may be used to 
treat disease caused by Alloiococcus otitidis. 

Discontinuous assay: 

In assay, NadE converts nicotinic acid adenine dinucleotide (NaAD) into 
nicotinamide adenine dinucleotide (NAD) in the presence of ammonia and ATP. 
Each PP, molecule produced by the NadE reaction can then be converted to 
two phosphate (Pi) molecules in the presence of inorganic pyrophosphatase 
(PPase). The P, molecules present can then be quantitated with a malachite 
green reagent at 660 nm. 

HPLC-based assay: 

Enzyme activity can be measured by HPLC quantitation of the reaction 
products. A neutralized aliquots from the reaction described above was injected 
into an HPLC system utilizing a 250 x4.6 mm Supelcosil LC-18 5fim reversed- 
phase column. The elution conditions: 9 min at 100% buffer A (0.1 M potassium 
phosphate buffer, pH6.0,6 min at up to 12% buffer B (buffer a, containing 20% 
methanol, 2.5 min at up to 45% buffer B, 2.5 min at up to 100% buffer B, and 
hold at 100% buffer B for 5.5 min. The eluate absorbance was monitored at 254 
nm (1). 
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Continuous assay: 

Coupled NadD-NadE assay. NadD and NadE can be detected in one 
continuous coupled assay. In first reaction, NadD combines nicotinic acid 
mononucleotide (NaMN) and adenosine triphosphate (ATP) to form nicotinic acid 
adenine dinucleotide (NaAD). NadE then converts NaAD into nicotinamide adenine 
dinucleotide (NAD) in the presence of ammonia and ATP. In the assay, the NAD 
product is reduced to NADH with alcohol dehydrogenase (ADH) and ethanol, thus 
permitting direct spectrometric detection of NADH at 340 nm wavelength. The 
coupled reaction above also includes inorganic pyrophosphatase (PPase) to prevent 
accumulation of the pyrophosphate byproduct from the consumption of ATP (this 
method can be use as HTS format). 

NadE assay. In assay, NadE converts NaAD into nicotinamide adenine 
dinucleotide (NAD) in the presence of ammonia and ATP. The NAD product is 
reduced to NADH with alcohol dehydrogenase (ADH) and ethanol, thus permitting 
direct spectrometric detection of NADH at 340 nm wavelength. The reaction above 
also includes inorganic pyrophosphatase (PPase) to prevent accumulation of the 
pyrophosphate byproduct from the consumption of ATP (this method can be use as 
HTS format). 

Example 35 

Alloiococcus otitidis encoded putative mem brane protein NorA 

An efflux transporter NorA that was originally identified in Staphylococcus 
aureus belongs to the family of multidrug resistance (MDR) transporters. NorA is 
encoded by chromosomally-located norA gene, it has broad substrate specificity and 
mediates resistance to various lipophilic and monocationic compounds such as 
ethidium bromide (EtBr), cetrimide, benzalkonium chloride, rhodamine 6G, 
tetraphenylphosphonium (TPP), chloramphenicol as well as some hygrophilic 
quinolones such as norfloxacin, ciprofloxacin and oxafloxacin. Increased levels of 
norA expression are associated with single nucleotide changes upstream of norA in a 
putative promoter/operator region and lead to increased pleiotropic resistance. NorA 
is a putative membrane protein with 1 2 predicted membrane-spanning domains and 
is classified as a member of major facilitator superfamily (MFS), a subgroup of MDR 
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transporters characterized by the presence of 12-14 transmembrane segments and 
the use of proton motive force as an energy source for drug efflux. NorA homologs 
that belong to MFS family include Bmr and Bit of Bacillus sufctilis, EmeA of 
Enterococcus faecalis and PmrA of Streptococcus pneumonia. The expression of 
5 bmr gene in B. subtiiis is upregulated by the product of adjacent bmR gene in the 
presence of inducers (rhodamine 6G and TPP), and there is an evidence that 
expression of norA in S. aureus is regulated by AlrS-AIrR two-component regulatory 
system. 

- It remains unknown whether the efflux of various toxins is a primary function 
10 of NorA. When overexpressed in E. co!i, norA produces resistance to a broad range 
of substrates including fluoroquinolones. Everted membrane vesicles prepared from 
nor/l-expressing E. co!i exhibit energy-dependent transport of norfloxacin, the 
transfer is abolished by cyanide m-chlorophenylhydrazone (CCCP) and nigericin but 
not by valinomycin indicating that NorA-mediated transfer is coupled to the proton 
15 gradient of cell membrane. Norfloxacin uptake in everted vesicles as well as NorA- 
associated resistance phenotype is inhibited by reserpine and verapamil that also 
inhibit other MDR transporters and are toxic to mammalian cells. Histidine-tagged 
NorA (NorA-His) was recently overexpressed and purified from E. coli t reconstituted 
into both everted membrane vesicles and proteoliposomes and was shown to 
20 function as a self-sufficient efflux pump using fluorescent dye Hoechst 33342. Due to 
its high specificity, essentiality, and importance, norA is attractive as an antibacterial 
target. Homologue of this gene identified in Alloiococcus otitidis is described in 
Example 5/Table 4 (Seq. ID No 67). The protein encoded by the gene is set forth in 
Seq. ID No. 68. 

25 

NorA as a target for anti-infective development 

The Alloiococcis otitidis ORF- has been shown to encode, by sequence 
homology, NorA (Seq. ID No. 67). NorA activity in the presence or absence of a 
30 putative inhibitory molecule of NorA activity is used to identify novel antimicrobial 

agents, which may be used to treat disease caused by Alloiococcus otitidis.. Because 
of broad substrate specificity of NorA, NorA inhibitors should be particularly useful 
against pathogens that possess multiple drug resistance. 
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Whole-cell high-throughput screen (HTS) assay that measures NorA activity 
in the presence or absence of a putative inhibitory molecule of Alloiococcis otitidis 
NorA activity is used to identify potential inhibitors of NorA activity. The assay utilizes 
B. subtilis strain (aaNA) that has both Bmr and Bit genetically inactivated while 
5 Alloiococcis otitidis NorA is supplied on the plasmid expression vector. The screen is 
based on the reversing of the resistance of aaNA to EtBr. The exponentially growing 
cells are inoculated into the wells of a 96-well plate to OD 6 oo=0.001 , the compounds 
are added at 20 pg/ml and EtBr is added at 10 pg/ml. Plates are incubated for 18 hrs 
at 37°C and examined for growth. Compounds that inhibit growth are subsequently 

10 tested in the presence/absence of EtBr for toxicity and effectivrty. The efflux of EtBr 
from cells is monitored as described previously. The exponentially growing cells are 
loaded with EtBr at a concentration of 10 Dg/ml for 20 min at 37°C in the presence of 
reserpine (20 Dg/ml). Cells are centrifuged, resuspended to an OD 600 =0.2 in a 
minimal medium GM1 alone or in the presence of inhibitor compound. Fluorescence 

15 of EtBr is monitored on a fluorimeter at an excitation □ of 530 nm and emission □ of 
600 nm.. 

Monitoring of Hoechst 33342 efflux 

The efflux of fluorescent dye Hoechst 33342 from either everted membrane 
20 vesicles prepared from Alloiococcus otitidis His-NorA overexpressing E. coli or a 
proteoliposomes reconstituted with Alloiococcus otitidis His-NorA is also used to 
monitor NorA activity in the presence or absence of putative inhibitors of NorA. 
Everted membrane vesicles are diluted into 2 ml of 50 mM potassium HEPES (pH 
7.2), 8.5 mM NaCI, 2 mM magnesium sulfate at a final protein concentration of 40 
25 pg/ml. NorA is activated by the addition of either 0.5 mM lactate or 0.1 mM Mg 2+ - 
ATP. Hoechst 33342 is used in a range of 12.5 to 200 nM. Inhibitors are added at 
various concentrations prior to the addition of Hoechst 33342. Fluorescence change 
is monitored at excitation and emission wavelenghths of 355 and 457 nm 
respectively in a FluoroMax spectrofluorimeter. For proteoliposome assay, the His- 
30 NorA proteoliposomes are diluted into a cuvette containing 2 ml of 20 mM potassium 
phosphate, 50 mM potassium sulfate, 2 mM magnesium sulfate (pH 7.0) at a protein 
concentration of 10 pg/ml. The inhibitor compounds and Hoechst 33342 are added at 
various concentrations and the fluorescence is measured as described previously. 
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Example 36 

ALLOIOCOCCUS OTtTIDIS ENCODED OBG GTPASE 

5 The obg gene is the second gene in a two-gene operon along with the stage- 

O sporulation gene spoOB in B. subtilis. SpoOB is central to the phospho-relay 
signal cascade that initiates sporulation. Obg is a member of the GTPase 
superfamily by virtue of homology throughout a small portion of the protein that in 
other members of the family is responsible for nucleotide (GTP/GDP) binding. Obg 

10 is essential for growth. Initiation of sporulation is thought to be triggered by changes 
in the GTP content of the cell; therefore, the presence of a GTP binding protein in an 
operon with a central player in the process is suggestive of a role for Obg in sensing 
GTP levels and transmitting a signal to SpoOB. 

It has been shown that Obg is involved in activation of the a 8 transcription 

15 factor in B. subtilis in response to environmental stress. Cells were depleted of Obg 
utilizing a construct that put obg under the control of an inducible (P lac ) promoter. 
Depletion of IPTG resulted in bacteria that failed to activate a 8 . These studies further 
showed by yeast-two-hybrid analysis that Obg interacted with several known a 8 
regulators, the so-called Rsb proteins. 

20 The role Obg plays in transmitting signals important for sporulation and 

activation of the stress sigma factor may be indicative of the activities that small GTP 
binding proteins carry out in triggering cell division in response to GTP levels. Due to 
its high specificity, essentiality, and importance, obg is attractive as an antibacterial 
target. Homologue of this gene identified in Alloiococcus otitidis is described in 

25 Example 5/Table 4 (Seq. ID No 71 ). The protein encoded by the gene is set forth in 
Seq. ID No. 72. 

Obg as a target for anti-infective development 

30 Obg is essential for bacterial viability. Conditional lethal alleles revealed that 

Obg is required for early events in sporulation and is involved in transmitting signals 
require for activation of the stress sigma factor. The Alloiococcis otitidis ORF- has 
been shown to encode, by sequence homology, obg (Seq. ID No.71). Obg activity in 
the presence or absence of a putative inhibitory molecule of Obg activity is used to 
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identify novel antimicrobial agents, which may be used to treat disease caused by 
Alloiococcus otitidis.. 

Nucleotide binding 

Obg binding to nucleotide in the presence or absence of putative 
antimicrobials, which inhibit Obg activity, is monitored by a simple filter-binding 
assay. Alloiococcus otitidis Obg (1-5 ug) is incubated with c^P-GTP (0.2 uCi) in a 
buffer consisting of 50 mM Tris (pH 8.5), 1 .5 mM MgCI 2 , 0.1 mM EDTA, 200 mM KCI, 
.1 0% glycerol for 30 minutes to 3 hours at 37*C. A portion of the reaction mix is 
spotted on nitrocellulose membrane, washed (50 mM Tris (pH 8.5), 1 .5 mM MgCI 2 , 1 
mM DTT) and dried. The membrane is then exposed to X-ray film. Alternatively, the 
spots are excised and counted. This assay is directly amenable to HTS using filter 
plates. 

i GTPase activity 

The GTP hydrolytic activity of Obg is monitored using thin-layer 
chromatography (1 , 2, 10). Obg and cr^P-GTP are incubated in 50 mM Tris (pH 8.5), 
1 .55 mM MgCI 2 , 0.1 mM EDTA, 200 mM KCI, 10% glycerol for 30 minutes at 37°C. 
An aliquot of the reaction is placed on PEI cellulose and the strip developed with 0.5 
20 M KH 2 P0 4 , 1 .0 M NaCI (pH 3.7). The spots conforming to GDP and GTP are 
identified by UV shadowing, excised and counted. . 

Alternatively, the hydrolysis of v^P-GTP is monitored by assaying for 
liberated P, (12). Obg and o^P-GTP are incubated in 50 mM Tris (pH 8.5), 1 .5 mM 
MgCI 2 , 0.1 mM EDTA. 100 mM KCI, 10% glycerol for 30 minutes to 3 hours at 37°C. 
25 The reaction is stopped by the addition of a slurry of charcoal in 1 mM Kpi (pH 7.5). 
which selectively binds the GTP and GDP. The liberated P, in the supernatant is 
monitored by Cerenkov counting. Free P, is also monitored with the Malachite Green 
reagent. 

30 Autophosphorylation 

Obg autophosphorylation is monitored by incubating Obg with v^P-GTP in 50 
mM Tris (pH 8.5), 1.5 mM MgCI 2 , 0.1 mM EDTA, 100 mM KCI, 10% glycerol for 30 
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minutes at 37°C. Samples are analyzed following separation on SDS polyacrylamide 
gels, drying the gel and exposure to film. 

Example 37 

5 RPOA, RPOB. RPOC, AND RPOP. THE GENES ENCODING THE SUBUNIT S COMPRISING 

Alloiococcus otitidis RNA Polymerase: alpha, beta, beta', an d sigm a. 

RNA polymerase is an enzyme comprised of multiple highly conserved 
subunits which catalyzes the DNA template directed polymerization of ribonucleic 

10 nucleotides into ribonucleic acid. It is composed of a core enzyme, □2,D,D , J along 
with a fifth subunit present in stoichiometric amounts, □□□which can catalyze RNA 
synthesis non-specifically. Holoenzyme is formed by the introduction of the subunit 
□ which enhances gene promoter recognition and allows specificity. Homoiogs of 
the genes identified in Alloiococcus otitidis are described in Example 5/Table 4 (Seq. 

15 ID Nos 7, 9, 11, and 1 3). The amino acid sequence of the protein encoded by these 
genes are set forth in Seq. ID Nos. 8, 10, 12 and 14. 

Functions for the individual subunits have been defined biochemically, and 
interactions between them have now been deduced structurally by crystallographic 
analysis of the enzyme from Thermatoga thermophila, and to a lesser extent, 

20 Escherichia coll The alpha subunit, encoded by rpoA, is required for enzyme 

assembly. It also interacts with transcription factors and with DNA elements involved 
in enhanced promoter strength. Beta, encoded by rpoB, is involved in initiation and 
elongation of the polymerization product. Beta' (encoded by rpoC), is responsible for 
binding of the enzyme to the DNA template. Omega is required to restore denatured 

25 RNA polymerase to function in vitro. Finally, sigma, encoded by rpoD, directs the 
enzyme to promoters on the template to enhance specificity of transcription 
(polymerization). 
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Alloiococcus otitidis RNA Polymerase: alpha, beta, beta', and sigma as a 

TARGET FOR ANTI-INFECTIVE DEVELOPMENT 

Bacterial RNA polymerase is a validated target for antimicrobial 
chemotherapy in that several inhibitors have been identified and at least one, 

5 rifampin, is in use clinically. Alloiococcus otitidis RNA polymerase holoenzyme is 
essential for bacterial viability. The Alioiococcis otitidis ORFs- have been shown to 
encode, by sequence homology, RNA polymerase holoenzyme (Seq. ID Nos. 7, 9, 
1 1 and 13). Alloiococcus otitidis RNA Polymerase activity in the presence or absence 
of a putative inhibitory molecule of Alloiococcus otitidis RNA Polymerase activity is 

10 used to identify novel antimicrobial agents, which may be used to treat disease 
caused by Alloiococcus otitidis. 

Assays for the activity of RNA polymerase 

Genes encoding the subunits of Alloiococcus otitidis RNA polymerase can be 

15 obtained using polymerase chain reaction amplification of the genomic region 

encoding them. The genes are subcloned into a standard expression vector either 
containing an amino acid tag for ease of purification or not. The enzyme are 
overexpressed in Escherichia coli and purified using a standard tag system or 
conventional chromatography . 

20 Because RNA polymerase catalyzes the incorporation of single ribonucleotides 

into RNA, the incorporation of radiolabeled nucleotides into larger oligonucleotides is 
monitored to measure activity of the enzyme in the presence or absence of putative 
inhibitors of RNA polymerase activity. An automated high throughput filtration assay 
has been previously described for E. coli polymerase which uses filterplates 

25 containing a hydrophobic membrane and DEAE beads to capture polymerized RNA. 
G-less supercoiled DNA is used as a template at 6 ug/ml. Reaction contained 0.5 
mM ATP, 0.1 mM UTP, 0.3 mM CTP, approximately 100,000 counts per minute (per 
100 ul) [y-^PJ CTP (2000 Ci/mmol, NEN/DuPont), 4 % polyethylene glycol, 4 mM 
DTT, 10 mM MgCI 2 , in 50 mM Tris-acetate (pH 7.8), and 100 mM potassium acetate. 

30 The reaction is carried out at 34 degrees C for 40 minutes, with 10% DMSO present 
in all reactions. The reaction was stopped by adding 100 ul 15% DEAE-Sephacel 
bead slurry in 50% methanol, 20 mM EDTA, and 0.02% NP-40. The reaction was 
incubated for 40-60 minutes at room temperature without shaking, and then 
transferred to a unifilter plate on a filtermate cell harvester. The wells were washed 
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six times with 2X PBS and 0.1% NP-40. After washing the bottom of the plate was 
sealed, and 50 ui scintillation counting liquid was added. Radioactivity was counted 
using a microplate scintillation counter. 

Deconvolution assays are carried out by measuring the inhibition of sigma 

5 activity. Because sigma is required only for promoter specificity, polymerization may 
occur non-specifically if sigma is inhibited. Consequently a second assay is 
described above that is used to deconvolute activity against sigma. 

The binding of putative inhibitory compounds to core enzyme. Several 
techniques are utilized to determine the interaction of inhibitors with individual 

10 subunits and include nuclear magnetic resonance and capillary electrophoresis. 

Example 38 

YPHC, encoding a small GTPase of unknown function from Alloiococcus 

otitidis 

15 

The yphC was initially identified in Bacillus subtilis in a collaboration between 
Wyeth and Millennium pharmaceuticals as being essential for growth by insertional 
mutagenesis. Subsequently it was determined that YphC, the encoded protein, 
contained two GTPase domains and had some homology to era. It was further 

20 identified in Thermatoga maritima and Escherichia coli . While no function has yet 
been determined for yphC, it appears that the carboxy terminal may contain an RNA 
binding site. In addition, site directed mutagenesis of four amino acids in the carboxy 
region were found to be lethal (unpublished results, Millennium). Under non- 
permissive conditions, strains carrying temperature sensitive alleles of the gene in E. 

25 coli become elongated, and chromosome segregation becomes abberrant, 

suggesting a role in cell division. Homologue of this gene identified in Alloiococcus 
otitidis is described in Example 5>Table 4 (Seq. ID No 73). The protein encoded by 
the gene is set forth in Seq. ID No. 74. 
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YphC from Alloiococcus otitidis as a target for antimicrobial chemotherapy 

YphC is an essential protein in Bacillus subtilis and E. coli, and is conserved 
among bacteria including Alloiococcus otitidis. The Alloiococcis otitidis ORF- has 

5 been shown to encode, by sequence homology, YphC (Seq. ID No. 73). YphC 

activity in the presence or absence of a putative inhibitory molecule of YphC activity 
is used to identify novel antimicrobial agents, which may be used to treat disease 
caused by Alloiococcus otitidis.. Consequently it is proposed here that an assay 
which identified inhibitors of YphC from Alloiococcus would result in small molecules 

10 which can be developed into effect antimcrobial agents. Additionally, because of the 
conservation of the enzyme among bacteria, inhibitors of the protein's function from 
this organism should have broad spectrum activity. 

Assays for the GTP hydrolysis by YphC 

15 The YphC gene from Alloiococcus otitidis is obtained using polymerase chain 

reaction amplification of the genomic region encoding it. The gene is subcloned into 
a standard expression vector either containing an amino acid tag for ease of 
purification or not. The enzyme is then overexpressed in Escherichia coli and 
purified using a standard tag system or conventional chromatography. Activity of 

20 YphC in the presence or absence putative antimicrobial agents is monitored using 
the assay system described below. 

GTP hydrolysis - detection by thin layer chromatography: Reaction is 
carried out in a 50 ul reaction of 50 mM Tris-CI (pH 7.5), 400 mM KCI, 5 mM MgCI2, 
25 1 mM DTT, 1 0 uM [a-32P] GTP, and 1 0 ug purified YphC, at 37 degrees for 1 0 

minutes. The reaction is terminated by transfer of 5 ul samples to 10 ul of ice-cold 20 
mM EDTA. Portions are spotted onto polyethyleneimine-cellulose thin layer 
chromatography plates, which are developed in 0.75 KH2P04 (pH 3.65). The plate 
is autoradiographed to identify hydrolysis products. 
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WHAT IS CLAIMED IS: 



5 1. A purified or isolated Alloiococcus otitidis nucleic acid sequence comprising a 
nucleotide sequence selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105, wherein expression of said nucleic acid 
is essential for the proliferation of a cell. 

10 2. A purified or isolated nucleic acid of Alloiococcus otitidis comprising a 

fragment of one of odd numbered sequences set forth in Seq. ID Nos: 1 to 
Seq. ID Nos: 105 said fragment selected from the group consisting of 
fragments comprising at least 10, at least 20, at least 25, at least 30, at least 
50 and more than 50 consecutive nucleotides of one of one of odd numbered 

15 sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

* 

3. A purified or isolated antisense nucleic acid comprising a nucleotide 

sequence complementary to at least a portion of an intragenic sequence, 
intergenic sequence, sequences spanning at least a portion of two or more 
20 genes, 5* noncoding region, or 3' noncoding region within an operon 

comprising a proliferation-required gene of Alloiococcus otitidis whose activity 
or expression is inhibited by an antisense nucleic acid and selected from one 
of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 

25 4. A purified or isolated nucleic acid comprising a nucleotide sequence having at 
least 70% identity to a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, 
fragments comprising at least 25 consecutive nucleotides selected from one 
of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, 

30 the nucleotide sequences complementary to one of odd numbered sequences 

set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, and the sequences 
complementary to fragments comprising at least 25 consecutive nucleotides 
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of one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID 
Nos: 105. 

5. A vector comprising a promoter operably linked to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence of any one of odd numbered sequences 
set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 



10 5. A purified or isolated polypeptide of Alloiococcus otitidis comprising a 
polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence of one of odd numbered sequences set 
forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a fragment selected from the 
group consisting of fragments comprising at least 5, at least 10, at least 20, at 

15 least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 

amino acids of one of the said polypeptides. 

6. A purified or isolated Alloiococcus otitidis polypeptide comprising a amino 
acid sequence having at least 25% amino acid identity to a polypeptide 

20 whose expression is inhibited by a nucleic acid comprising a nucleotide 

sequence selected from one of odd numbered sequences set forth in Seq. ID 
Nos: 1 to Seq. ID Nos: 105, or at least 25% amino acid identity to a fragment 
comprising at least 10, at least 20, at least 30, at least 40, at least 50, at least 
60 or more than 60 consecutive amino acids of a polypeptide whose 

25 expression is inhibited by a nucleic acid comprising a nucleotide sequence 

selected from the group consisting of one of odd numbered sequences set 
forth in Seq. ID Nos: 1 to Seq. ID Nos: 105. 



30 7. A purified or isolated Alloiococcus otitidis polypeptide comprising selected 

from one of the even numbered sequences set forth in Seq. ID Nos: 2 to Seq. 
ID Nos: 106, wherein the polypeptide is essential for the proliferation of a cell. 
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8. A method of producing an Alloiococcus otitidis polypeptide comprising 

introducing into a cell a vector comprising a promoter operably linked to a 
nucleic acid comprising a nucleotide sequence encoding a polypeptide whose 
5 expression is essential for the proliferation and viability of Alloiococcus 

otitidis, and which is inhibited by an antisense nucleic acid, and which is 
selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 to 
Seq. ID Nos: 105. 



10 

9. A method of inhibiting the proliferation of Alloiococcus otitidis in an individual 
comprising inhibiting the activity or reducing the amount of a gene product 
whose expression is inhibited by an antisense nucleic acid comprising a 
nucleotide sequence selected from one of odd numbered sequences set forth 
15 in Seq. ID Nos: 1 to Seq. ID Nos: 105 or inhibiting the activity or reducing the 

amount of a nucleic acid encoding said gene product. 



10. A method for identifying a compound which influences the activity of an 
20 Alloiococcus otitidis gene product , which is required for proliferation, said 

gene product comprising a gene product whose expression is inhibited by an 
antisense nucleic acid comprising a nucleotide sequence selected from one 
of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, 
said method comprising: 

25 

(a) contacting said gene product with a candidate compound; and 

(b) determining whether said compound influences the activity of said 
gene product. 

30 11. A method for identifying a compound or an antisense nucleic acid having the 
ability to reduce activity or level of a Alloiococcus otitidis gene product, which 
is required for proliferation, said gene product comprising a gene product 
whose activity or expression is inhibited by an antisense nucleic acid 
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comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, said method 

comprising the steps of: 

(a) contacting a target gene or RNA encoding said gene product with 
a candidate compound or antisense nucleic acid; and 

(b) measuring the activity of said target. 

13. A method for inhibiting cellular proliferation of Alloiococcus otitidis comprising 
introducing an effective amount of a compound with activity against a gene 
whose activity or expression is essential for cellular proliferation, and which is 
inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected from one of odd numbered sequences set forth in Seq. ID Nos: 1 to 
Seq. ID Nos: 105, or a compound with activity against the product of said 
gene into a population of Alloiococcus otitidis cells expressing said gene. 

1 3. A composition comprising an effective concentration of an antisense nucleic 
acid comprising a nucleotide sequence selected from one of odd numbered 
sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 105, or a proliferation- 
inhibiting portion thereof in a pharmaceutical^ acceptable carrier. 



A method for identifying a compound having the ability to inhibit proliferation 

of Alloiococcus otitidis cell comprising: 

(a) identifying a homologue of a gene or gene product whose activity 
or level is inhibited by a nucleic acid comprising a nucleotide 
sequence selected from one of odd numbered sequences set forth 
in Seq. ID Nos: 1 to Seq. ID Nos: 105, in a test cell, wherein said 
test cell is not Alloiococcus otitidis; 

(a) identifying an inhibitory nucleic acid sequence which inhibits the 
activity of said homologue in said test cell; 

(b) contacting said test cell with a sublethal level of said inhibitory 
nucleic acid, thus sensitizing said cell; 

(c) contacting the sensitized cell of step (c) with a compound; and 
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(d) determining the degree to which said compound inhibits 

proliferation of said sensitized cell relative to a cell which does not 
contain said inhibitory nucleic acid. 

5 16. A method for identifying a compound having activity against a biological 
pathway required for proliferation comprising: 

(a) sensitizing a cell by providing a sublethal level of an antisense 
nucleic acid complementary to a nucleic acid encoding a gene 
product required for proliferation, wherein the activity or expression 

10 of said gene product is inhibited by an antisense nucleic acid 

comprising a nucleotide sequence selected from one of odd 
numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 
105, in said cell to reduce the activity or amount of said gene 
product; 

15 (a) contacting the sensitized cell with a compound; and 

(b) determining the degree to which said compound inhibits the 
growth of said sensitized cell relative to a cell which does not 
contain said antisense nucleic acid. 



20 17. A method for identifying a compound having the ability to inhibit one of the 

Alloiococcus otitidis polypeptides encoded by a polynucleotide selected from 
one of odd numbered sequences set forth in Seq. ID Nos: 1 to Seq. ID Nos: 
105, and which is essential for cellular proliferation comprising: 

(a) contacting a cell which expresses the polypeptide with the 
25 compound; and 

(b) determining whether said compound reduces proliferation of said 
contacted cell by acting on said gene product. 

18. A method for identifying a compound having the ability to inhibit one of the 
30 purified and isolated Alloiococcus otitidis polypeptides selected from one of 

the even numbered sequences set forth in Seq. ID No.: 2 to Seq. ID No.: 106, 
and which is essential for cellular proliferation comprising: 
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(c) contacting the purified and isolated polypeptide with the compound 
in vitro in the presence or absence of a substrate, which is 
essential for the activity of the polypeptide; and 

(d) determining the effect of the compound on the polypeptide by 
measuring the effect of the polypeptide on the substrate. 



19. A compound which interacts with an Alloiococcus otitidis polypeptide selected 
from one of the even numbered sequences set forth in Seq. ID No.: 2 to Seq. 

10 ID No.: 106 and inhibits its activity. 

20. A method for manufacturing an antimicrobial compound comprising the steps 
of screening one or more candidate compounds to identify a compound that 
reduces the activity or level of an Alloiococcus otitidis polypeptide selected 

15 from one of the even numbered sequences set forth in Seq. ID No.: 2 to Seq. 

ID No.: 106, said polypeptide comprising a gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a nucleotide 
sequence selected from one of the odd numbered sequences set forth in Seq. 
ID No.: 1 to Seq. ID No. 105; and manufacturing the compound so identified. 

20 

21 . A compound which inhibits proliferation of Alloiococcus otitidis by interacting 
with a gene encoding a polypeptide that is required for proliferation or with a 
polypeptide required for proliferation, wherein said polypeptide is selected 
from the group consisting of a gene product having at least 70% nucleotide 

25 sequence identity from one of the odd numbered sequences set forth in Seq. 

ID No.: 1 to Seq. ID No. 105, polypeptide encoded by a nucleic acid having at 
least 70% nucleotide sequence identity to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid 
comprising a nucleotide sequence selected from one of the odd numbered 

30 sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105, a polypeptide 

having at least 25% amino acid identity to a gene product whose expression 
is inhibited by an antisense nucleic acid comprising a nucleotide sequence 
selected one of the odd numbered sequences set forth in Seq. ID No.: 1 to 
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Seq. ID No. 105, a polypeptide encoded by a nucleic acid comprising a 
nucleotide sequence which hybridizes to a nucleic acid selected from one of 
the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. ID No. 105 
under stringent conditions, a gene product encoded by a nucleic acid 

5 comprising a nucleotide sequence which hybridizes to a nucleic acid selected 

from one of the odd numbered sequences set forth in Seq. ID No.: 1 to Seq. 
ID No. 105 under moderate conditions, and a gene product whose activity 
may be complemented by the gene product whose activity is inhibited by a 
nucleic acid selected from one of the odd numbered sequences set forth in 

10 Seq. ID No.: 1 to Seq. ID No. 105. 
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SEQUENCE LISTING 

<110> American Cyanamid Company, and Murphy, Ellen and Pro j an, Stephen, j . 

<120> Alloiococcus otitidis Infectious Disease Targets 

* 

<13 0> Application 1 
<160> 106 

<170> Patentln version 3.1 

<210> 1 
<211> 42S 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (73) . . (426) 

<223> 

<400> 1 

aagacaaaaa agaagaggga aaagatctta agacacttcc ctaagtctga acatattcta 60 



ggagggttac aa gtg att aca gga atg ggt gtg gat att gtt gaa atg age 111 

Met lie Thr Gly Met Gly Val Asp lie Val Glu Met Ser 
15 10 

egg att caa get gtt tgg gac cga aag ccc age ttt gec cag egg att 159 
Arg lie Gin Ala Val Trp Asp Arg Lys Pro Ser Phe Ala Gin Arg lie \ 
15 20 25 

tta acc caa agg gag ttg get tat ttc gag aaa gcg act ggt agg egg 207 
Leu Thr Gin Arg Glu Leu Ala Tyr Phe Glu Lys Ala Thr Gly Arg Arg 
30 35 40 45 

aga att gaa ttc eta gcg gga egg ttt gee ggt aaa gaa get tac agt 255 
Arg lie Glu Phe Leu Ala Gly Arg Phe Ala Gly Lys Glu Ala Tyr Ser 

50 55 60 

aaa gee ttg gga act ggt att gga cgc ttg age ttt aaa gat att gaa 303 
Lys Ala Leu Gly Thr Gly lie Gly Arg Leu Ser Phe Lys Asp He Glu 

65 70 75 

ate eta ate aat gac caa ggc cag cca gtc eta aca tct cat cct aaa 351 
He Leu He Asn Asp Gin Gly Gin Pro Val Leu Thr Ser His Pro Lys 
80 85 90 

get ggc egg gee ttg att tea att tct cac act aga gac etc tgc ctg 3 99 

Ala Gly Arg Ala Leu He Ser He Ser His Thr Arg Asp Leu Cys Leu 
95 100 105 



gee cag gtc ctt tta cag gaa aat tga 
Ala Gin Val Leu Leu Gin Glu Asn 
110 115 



426 



WO 03/104391 



2/235 



PCT/US02/36122 



<210> 2 
<211> 117 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 2 

Met He Thr Gly Met Gly Val Asp He Val Glu Met Ser Arg He Gin 
1 5 10 15 

Ala Val Trp Asp Arg Lys Pro Ser Phe Ala Gin Arg He Leu Thr Gin 

20 25 30 

Arg Glu Leu Ala Tyr Phe Glu Lys Ala Thr Gly Arg Arg Arg He Glu 
35 40 45 

Phe Leu Ala Gly Arg Phe Ala Gly Lys Glu Ala Tyr Ser Lys Ala Leu 
50 55 60 

Gly Thr Gly He Gly Arg Leu Ser Phe Lys Asp He Glu He Leu He 
65 70 75 80 

Asn Asp Gin Gly Gin Pro Val Leu Thr Ser His Pro Lys Ala Gly Arg 

85 90 95 



Ala Leu He Ser He Ser His Thr Arg Asp Leu Cys Leu Ala Gin Val 

100 105 HO 



Leu Leu Gin Glu Asn 
115 



<210> 3 
<211> 1410 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (16) . . (1410) 
<223> 

<400> 3 

ataggagtca taacg gtg tct tgg aaa tta aaa gag att gcc cag gca gtt . 51 

Met Ser Trp Lys Leu Lys Glu He Ala Gin Ala Val 
1 5 10 



ggg gga gag eta gtt agt gcg gac ggc cag gag gag gtc acc ggg gtc 
Gly Gly Glu Leu Val Ser Ala Asp Gly Gin Glu Glu Val Thr Gly Val 



99 
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15 20 25 

cac ttt gat tea agg cga ctt gaa cca ggt gac ttg ttt gtt cct att 147 

His Phe Asp Ser Arg Arg Leu Glu Pro Gly Asp Leu Phe Val Pro lie 
30 35 40 

tta ggc cag egg gat ggt cat gat ttt gee caa gee gee eta gac caa 195 

Leu Gly Gin Arg Asp Gly His Asp Phe Ala Gin Ala Ala Leu Asp Gin 

45 50 55 60 



gga get age gga gee ttt tgg gec aaa gat tea age tta gee cct aaa 
Gly Ala Ser Gly Ala Phe Trp Ala Lys Asp Ser Ser Leu Ala Pro Lys 

65 70 75 



aac ggg gat gaa ccc ctg ctt gag tct gee ttg aac cac cac ccc cac 
Asn Gly Asp Glu Pro Leu Leu Glu Ser Ala Leu Asn His His Pro His 

225 230 235 



243 



ggt ctt ccc ttg ate aag gta gaa gat age tac cag gee eta gtt gac 291 
Gly Leu Pro Leu lie Lys Val Glu Asp Ser Tyr Gin Ala Leu Val Asp 

80 85 9 0 

ctg gee aag tgg cat ctt gaa get gtc gca cct atg aaa att gee ate 339 
Leu Ala Lys Trp His Leu Glu Ala Val Ala Pro Met Lys He Ala He 
95 100 105 

acc ggc agt aat ggg aag ace act act aag gac atg gtg get agt gtg 3 87 

Thr Gly Ser Asn Gly Lys Thr Thr Thr Lys Asp Met Val Ala Ser Val 
110 115 120 

gtg ggc caa gca ttt aag tgt cac aaa aca gtt age aac tta aat aat 435 
Val Gly Gin Ala Phe Lys Cys His Lys Thr Val Ser Asn Leu Asn Asn 
125 130 135 140 

gaa ctt ggc gtg ccc atg act ate tta get atg cct gca gac tgc cag 483 
Glu Leu Gly Val Pro Met Thr He Leu Ala Met Pro Ala Asp Cys Gin 

145 150 155 

gtc ata gtt gtt gaa atg ggc atg gat gga cca ggt cag ate teg gee 531 
Val He Val Val Glu Met Gly Met Asp Gly Pro Gly Gin He Ser Ala 

160 165 170 

ttg tec aaa etc ttg cag cct gac att gee att ate acc atg att ggc 579 
Leu Ser Lys Leu Leu Gin Pro Asp He Ala He He Thr Met He Gly 
175 180 185 

gag gee cac ate gag ttc ttt ggg tea agg gac aaa att gee cag gec 627 
Glu Ala His He Glu Phe Phe Gly Ser Arg Asp Lys He Ala Gin Ala 
190 195 200 

aaa ctg gaa att eta gat ggc eta age gac cag ggc gtc ttt att gee 675 
Lys Leu Glu He Leu Asp Gly Leu Ser Asp Gin Gly Val Phe He Ala 
205 210 215 220 



723 



age ctg cgt ttt ggc caa teg ccc cac aat gac att tat cct ttg acc 771 
Ser Leu Arg Phe Gly Gin Ser Pro His Asn Asp He Tyr Pro Leu Thr 

240 245 250 
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act gag att gga cag egg caa age cag ttc acc ctt aac ctg gac cct 
Thr Glu He Gly Gin Arg Gin Ser Gin Phe Thr Leu Asn Leu Asp Pro 
255 260 265 

agt ctg caa ttt acc ate cct tea cca gga aaa tat aat gtc att aac 
Ser Leu Gin Phe Thr He Pro Ser Pro Gly Lys Tyr Asn Val He Asn 
270 275 280 

gee eta get gca gtc ttg gta gee cag gtc ctg gac ttg gac etc caa 
Ala Leu Ala Ala Val Leu Val Ala Gin Val Leu Asp Leu Asp Leu Gin 
285 290 295 300 

eta get gtc cag ggc ttg gee cag ttt cag eta age aaa aac egg ctg 
Leu Ala Val Gin Gly Leu Ala Gin Phe Gin Leu Ser Lys Asn Arg Leu 

305 310 315 

gaa tgg eta aaa ggc tat aag cag gee cac tta tta aat gat 'get tac 
Glu Trp Leu Lys Gly Tyr Lys Gin Ala His Leu Leu Asn Asp Ala Tyr 

320 325 330 

aat get agt ccc act tec atg aag gcg gtc ttg gat tat ttc age cat 
Asn Ala Ser Pro Thr Ser Met Lys Ala Val Leu Asp Tyr Phe Ser His 
335 340 345 



819 



gac ccc aaa ctt tta gac egg gtt gtc tta tat gga cca gaa atg gca 
Asp Pro Lys Leu Leu Asp Arg Val Val Leu Tyr Gly Pro Glu Met Ala 

385 390 395 



tat ttc cca gag gat cga aaa gee ttg acc gac ttt tta aaa gaa ate 
Tyr Phe Pro Glu Asp Arg Lys Ala Leu Thr Asp Phe Leu Lys Glu He 
415 420 425 

atg ggc cca tct tct tat ctt ttg ttg aag tec agt eta gga aca ggt 
Met Gly Pro Ser Ser Tyr Leu Leu Leu Lys Ser Ser Leu Gly Thr Gly 
430 435 440 

ctg ctt gaa gtg gtc caa gee eta agt caa aaa gaa gat gat gaa aac 
Leu Leu Glu Val Val Gin Ala Leu Ser Gin Lys Glu Asp Asp Glu Asn 
445 450 455 460 

cag ccc ctg gac taa 
Gin Pro Leu Asp 



867 



915 



963 



1011 



1059 



ttg gac eta gat ggg gag aag ata gcg gtt tta ggg gac ttg egg gag 1107 
Leu Asp Leu Asp Gly Glu Lys lie. Ala Val Leu Gly Asp Leu Arg Glu 
350 355 360 

tta ggg tct ttg tec ggt caa etc cac egg tea ctt agt caa gee ate 1155 
Leu Gly Ser Leu Ser Gly Gin Leu His Arg Ser Leu Ser Gin Ala He 
365 370 375 380 



1203 



gee etc tac cag gtc ttg aag get gat ttt gat cct gac cac ttg act 1251 
Ala Leu Tyr Gin Val Leu Lys Ala Asp Phe Asp Pro Asp His Leu Thr 

400 405 410 



1299 



1347 



1395 



1410 
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<210> 4 
<211> 464 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 4 

Met Ser Trp Lys Leu Lys Glu lie Ala Gin Ala Val Gly Gly Glu Leu 
15 10 15 



Val Ser Ala Asp Gly Gin Glu Glu Val Thr Gly Val His Phe Asp Ser 

20 25 30 



Arg Arg Leu Glu Pro Gly Asp Leu Phe Val Pro lie Leu Gly Gin Arg 
35 40 45 



Asp Gly His Asp Phe Ala Gin Ala Ala Leu Asp Gin Gly Ala Ser Gly 
50 55 60 



Ala Phe Trp Ala Lys Asp Ser Ser Leu Ala Pro Lys Gly Leu Pro Leu 
65 70 75 80 

lie Lys Val Glu Asp Ser Tyr Gin Ala Leu Val Asp Leu Ala Lys Trp 

85 90 95 



His Leu Glu Ala Val Ala Pro Met Lys lie Ala lie Thr Gly Ser Asn 

100 105 110 



Gly Lys Thr Thr Thr Lys Asp Met Val Ala Ser Val Val Gly Gin Ala 
115 120 125 

Phe Lys Cys His Lys Thr Val Ser Asn Leu Asn Asn Glu Leu Gly Val 
130 135 140 

Pro Met Thr He Leu Ala Met Pro Ala Asp Cys Gin Val He Val Val 
145 150 155 160 



Glu Met Gly Met Asp Gly Pro Gly Gin He Ser Ala Leu Ser Lys Leu 

165 170 175 

Leu Gin Pro Asp He Ala He He Thr Met He Gly Glu Ala His. He 

180 185 190 

Glu Phe Phe Gly Ser Arg Asp Lys He Ala Gin Ala Lys Leu Glu He 
195 200 205 
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Leu Asp Gly Leu Ser Asp Gin Gly Val Phe He Ala Asn Gly Asp Glu 
210 215 220 

Pro Leu Leu Glu Ser Ala Leu Asn His His Pro His Ser Leu Arg Phe 
225 230 235 240 

Gly Gin Ser Pro His Asn Asp He Tyr Pro Leu Thr Thr Glu He Gly 

245 250 255 

Gin Arg Gin Ser Gin Phe Thr Leu Asn Leu Asp Pro Ser Leu Gin Phe 

260 265 270 

Thr He Pro Ser Pro Gly Lys Tyr Asn Val He Asn Ala Leu Ala Ala 
275 280 285 

Val Leu Val Ala Gin Val Leu Asp Leu Asp Leu Gin Leu Ala Val Gin 
290 295 300 

Gly Leu Ala Gin. Phe Gin Leu Ser Lys Asn Arg Leu Glu Trp Leu Lys 
305 310 315 320 

Glv Tvr Lys Gin Ala His Leu Leu Asn Asp Ala Tyr Asn Ala Ser Pro 

325 330 335 



Thr Ser Met Lys Ala Val Leu Asp Tyr Phe Ser His Leu Asp Leu Asp 

340 345 350 

Gly Glu Lys He Ala Val Leu Gly Asp Leu Arg Glu Leu Gly Ser Leu 
355 360 365 

Ser Gly Gin Leu His Arg Ser Leu Ser Gin Ala He Asp Pro Lys Leu 
370 375 380 

Leu Asp Arg Val Val Leu Tyr Gly Pro Glu Met Ala Ala Leu Tyr Gin 
385 390 395 400 

Val Leu Lys Ala Asp Phe Asp Pro Asp His Leu Thr Tyr Phe Pro Glu 

405 410 415 

Asp Arg Lys Ala Leu Thr Asp Phe Leu Lys Glu He Met Gly Pro Ser 

420 425 430 
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Ser Tyr Leu Leu Leu Lys Ser Ser Leu Gly Thr Gly Leu Leu Glu Val 
435 440 445 



Val Gin Ala Leu Ser Gin Lys Glu Asp Asp Glu Asn Gin Pro Leu Asp 
450 455 460 



<210> 5 
<211> 1284 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (1284) 

<223> 

<400> 5 

gattgg atg aat ata atg aaa aaa eta ate ate aac ggt ggc egg ace 48 
Met Asn He Met Lys Lys Leu lie lie Asn Gly Gly Arg Thr 
15 10 

etc aag ggt gaa gtc acg gta tea ggg gee aaa aat agt acg gtg get 9 6 

Leu Lys Gly Glu Val Thr Val Ser Gly Ala Lys Asn Ser Thr Val Ala 
15 20 25 30 

etc att eca gca tct att tta gca gac age ccg gta ate eta gag ggg 144 
Leu He Pro Ala Ser He Leu Ala Asp Ser Pro Val He Leu Glu Gly 

35 40 45 

gta ccc gat ate cag gat gtt cat tec eta ctg gag att tta aat gaa 192 
Val Pro Asp He Gin Asp Val His Ser Leu Leu Glu He Leu Asn Glu 

50 55 60 

atg aat gtc aag ace gac ttt gac gga aac act ttg ace att gac cca 240 
Met Asn Val Lys Thr Asp Phe Asp Gly Asn Thr Leu Thr He Asp Pro 
65 70 75 

aga gaa atg gtc tct ate ccc atg cca agt ggt aag ate caa age ttg 288 
Arg Glu Met Val Ser He Pro Met Pro Ser Gly Lys He Gin Ser Leu 
80 85 90 

egg get tec tac tac ttt atg gga gee etc ttg gee aaa ttc ggt aaa 33 6 

Arg Ala Ser Tyr Tyr Phe Met Gly Ala Leu Leu Ala Lys Phe Gly Lys 
95 " 100 105 110 

ggg gta gtc ggt ctt ccc ggt ggt tgc ttc ctg ggg cca cga ccc ate 384 
Gly Val Val Gly Leu Pro Gly Gly Cys Phe Leu Gly Pro Arg Pro He 

115 120 125 

gac caa cac ttg aaa ggc ttc cgc ctg ctt gga gca gat gtg gat aat 432 
Asp Gin His Leu Lys Gly Phe Arg Leu Leu Gly Ala Asp Val Asp Asn 

130 135 140 
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gaa atg ggg gcc atg tac ctt aaa acc agt gat tea ggc eta gtg ggt 
Glu Met Gly Ala Met Tyr Leu Lys Thr Ser Asp Ser Gly Leu Val Gly 
145 150 155 

agt egg att tac tta gat gtt gtt teg att ggt gca acc att aat ate 
Ser Arg lie Tyr Leu Asp Val Val Ser He Gly Ala Thr He Asn He 
160 165 170 

atg tta gcc get gtt agg gcc caa ggt egg acg gtc att gag aat gcg 
Met Leu Ala Ala Val Arg Ala Gin Gly Arg Thr Val He Glu Asn Ala 
175 180 185 190 

gcc cga gaa cca gaa att att gat gtt gcc acc etc ttg aac aag atg 
Ala Arg Glu Pro Glu He He Asp Val Ala Thr Leu Leu Asn Lys Met 

195 200 205 



480 



gtt gac cag ctg act ggc tgc cag cac tec ate ate ccc gac egg att 
Val Asp Gin Leu Thr Gly Cys Gin His Ser He He Pro Asp Arg He 
225 230 235 

gaa get ggg acc tac ctg get att gca gcg gca get ggg gag gat gtc 
Glu Ala Gly Thr Tyr Leu Ala He Ala Ala Ala Ala Gly Glu Asp Val 
240 245 250 

ctg gta aac aat gtt ata gtt gaa cat att gat agt tta att gcc aaa 
Leu Val Asn Asn Val He Val Glu His He Asp Ser Leu He Ala Lys 
255 260 265 270 

etc gac gaa att ggt att gac ctg gac ate ggc gaa gac agt ate egg 
Leu Asp Glu He Gly lie Asp Leu Asp He Gly Glu Asp Ser He Arg 

275 280 285 

gtg aaa gcc ccc agt aaa cct ttg cag cct gtt acc ate aaa acc ctg 
Val Lys Ala Pro Ser Lys Pro Leu Gin Pro Val Thr He Lys Thr Leu 

290 295 300 

cct tac cct ggt ttt gcc act gac etc cag cag ccc ate acc cct etc 
Pro Tyr Pro Gly Phe Ala Thr Asp Leu Gin Gin Pro He Thr Pro Leu 
305 310 315 

ttg ctt ctg gcc aaa ggg gag tec gtt ate acc gat acc ate tat cct 
Leu Leu Leu Ala Lys Gly Glu Ser Val He Thr Asp Thr He Tyr Pro 
320 325 330 

aaa egg gtt aag cac ate cct gag ctg gaa egg atg ggg gcc aat ate 
Lys Arg Val Lys His He Pro Glu Leu Glu Arg Met Gly Ala Asn He 
335 340 345 350 

egg gtc gaa age gat ate ate etc att gaa ggt ggc cac ccc etc aag 
Arg Val Glu Ser Asp He He Leu He Glu Gly Gly His Pro Leu Lys 

355 360 365 



528 



576 



624 



ggg get aaa ata cgt ggg get ggc act gat atg ate egg att gaa ggg 672 
Gly Ala Lys He Arg Gly Ala Gly Thr Asp Met He Arg He Glu Gly 

210 215 220 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 
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ggg gca gaa gtg gaa gcc agt gat tta aga gcc ggg get tgc ttg att 
Gly Ala Glu Val Glu Ala Ser Asp Leu Arg Ala Gly Ala Cys Leu lie 

370 375 380 

aat gca ggt ttg ate gcg gaa ggt cag acg gaa att act ggc gtt gac 
Asn Ala Gly Leu He Ala Glu Gly Gin Thr Glu He Thr Gly Val Asp 
385 390 395 

aaa att eta aga ggc tac tct cat att gtt gaa aaa etc aat gac eta 
Lys He Leu Arg Gly Tyr Ser His He Val Glu Lys Leu Asn Asp Leu 
400 405 410 

ggc gca gat gtt tat atg caa gag ggg gaa gac tga 
Gly Ala Asp Val Tyr Met Gin Glu Gly Glu Asp 
415 420 425 



<210> 6. 
<211> 425 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 6 

Met Asn He Met Lys Lys Leu He He Asn Gly Gly Arg Thr Leu Lys 
1 5 10 15 

Gly Glu Val Thr Val Ser Gly Ala Lys Asn Ser Thr Val Ala Leu He 

20 25 30 



Pro Ala Ser He Leu Ala Asp Ser Pro Val He Leu Glu Gly Val Pro 
35 40 45 



Asp He Gin Asp Val His Ser Leu Leu Glu He Leu Asn Glu Met Asn 
50 55 60 

Val Lys Thr Asp Phe Asp Gly Asn Thr Leu Thr He Asp Pro Arg Glu 
65 ~ 70 75 80 

Met Val Ser He Pro Met Pro Ser Gly Lys He Gin Ser Leu Arg Ala 

85 90 95 

Ser Tyr Tyr Phe Met Gly Ala Leu Leu Ala Lys Phe Gly Lys Gly Val 

100 105 HO 

Val Gly Leu Pro Gly Gly Cys Phe Leu Gly Pro Arg Pro He Asp Gin 
115 120 125 



1152 



1200 



1248 



1284 



His Leu Lys Gly Phe Arg Leu Leu Gly Ala Asp Val Asp Asn Glu Met 
. 130 " 135 140 
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Gly Ala Met Tyr Leu Lys Thr Ser Asp Ser Gly Leu Val Gly Ser Arg 
145 150 155 160 

lie Tyr Leu Asp Val Val Ser lie Gly Ala Thr lie Asn He Met Leu 

165 170 175 

Ala Ala Val Arg Ala Gin Gly Arg Thr Val He Glu Asn Ala Ala Arg 

180 185 190 

Glu Pro Glu He He Asp Val Ala Thr Leu Leu Asn Lys Met Gly Ala 
195 200 205 

Lys He Arg Gly Ala Gly Thr Asp Met He Arg He Glu Gly Val Asp 
210 215 220 

Gin Leu Thr Gly Cys Gin His Ser He He Pro Asp Arg He Glu Ala 
225 230 235 240 

Gly Thr Tyr Leu Ala He Ala Ala Ala Ala Gly Glu Asp Val Leu Val 

245 250 255 



Asn Asn Val He Val Glu His He Asp Ser Leu He Ala Lys Leu Asp 

260 265 270 

Glu He Gly He Asp Leu Asp He Gly Glu Asp Ser He Arg Val Lys 
275 280 285 

Ala Pro Ser Lys Pro Leu Gin Pro Val Thr He Lys Thr Leu Pro Tyr 
290 295 300 

Pro Gly Phe Ala Thr Asp Leu Gin Gin Pro He Thr Pro Leu Leu Leu 
305 310 315 320 

Leu Ala Lys Gly Glu Ser Val He Thr Asp Thr He Tyr Pro Lys Arg 

325 330 335 

Val Lys His He Pro Glu Leu Glu Arg Met Gly Ala Asn He Arg Val 

340 345 350 

Glu Ser Asp He He Leu He Glu Gly Gly His Pro Leu Lys Gly Ala 
355 360 365 
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Glu Val Glu Ala Ser Asp Leu Arg Ala Gly Ala Cys Leu He Asn Ala 
370 375 380 



Gly Leu He Ala Glu Gly Gin Thr Glu He Thr Gly Val Asp Lys He 
385 390 395 400 



Leu Arg Gly Tyr Ser His He Val Glu Lys Leu Asn Asp Leu Gly Ala 

405 410 415 



Asp Val Tyr Met Gin Glu Gly Glu Asp 

420 425 



<210> 


7 










<211> 


612 










<212> 


DMA 










<213> 


Alloiococcus otitidis 










<220> 












<221> 


CDS 










<222> 


(4) . . (612) 










<223> 












<400> 


7 










ctt ttg cat aga caa gac ttg 


aat 


cgt 


gaa 


agg 


Met His Arg Gin Asp Leu 


Asn 


Arg 


Glu 


Arg 


1 


5 








10 



k ag tea gat gtg gaa 48 
,ys Ser Asp Val Glu 

15 

tta aaa gag ttt gat gga aag aaa aaa gaa gaa eta gcc atg att gat 96 
Leu Lys Glu Phe Asp Gly Lys Lys Lys Glu Glu Leu Ala Met He Asp 

20 25 30 

gtg gcc aag gcc att tta gac cag gtc cat gac ttg atg cac ttc aac 144 
Val Ala Lys Ala He Leu Asp Gin Val His Asp Leu Met His Phe Asn 

35 40 45 

gac etc ttg agt gaa gtg tct gaa tat eta -gac ttg tea gat gac gag 192 
Asp Leu Leu Ser Glu Val Ser Glu Tyr Leu Asp Leu Ser Asp Asp Glu 
50 55 60 

ate gaa age ggt atg ggc caa ttt tac acc gat tta aat att gac ggt 240 
He Glu Ser Gly Met Gly Gin Phe Tyr Thr Asp Leu Asn He Asp Gly 
65 70 75 

cgc ttc ate tct tta ggc gac aac cat tgg ggc tta cgt gaa tgg tat 288 
Arg Phe He Ser Leu Gly Asp Asn His Trp Gly Leu Arg Glu Trp Tyr 
80 85 90 95 

cca gtc gat tct ate gat gaa gag ttg acc cac gac aat gac ctg gag 33 6 

Pro val Asp Ser He Asp Glu Glu Leu Thr His Asp Asn Asp Leu Glu 

100 105 HO 



aag gtc aca ccc aag cag gcg gaa gac ggc ttt gat gac tta gag cat 



384 
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Lys Val Thr Pro Lys Gin Ala Glu Asp Gly Phe Asp Asp Leu Glu His 

115 ~ 120 125 

gtc gaa aaa gaa gtg atg gat gac gca aaa gaa gaa tta gat gac cag 
Val Glu Lys Glu Val Met Asp Asp Ala Lys Glu Glu Leu Asp Asp Gin 
130 135 140 

gcc gtc aat gaa gat gaa gaa aat gtt get cca gat gaa ate ace gac 
Ala Val Asn Glu Asp Glu Glu Asn Val Ala Pro Asp Glu lie Thr Asp 
145 150 155 

gat gga gat gaa gac aag ctg gat gaa tac tct age gat ate gaa gac 
Asp Gly Asp Glu Asp Lys Leu Asp Glu Tyr Ser Ser Asp lie Glu Asp 
160 "* 165 170 175 

etc gaa gat gat cgt aag get age caa gac aag ctg tec att gtt gac 
Leu Glu Asp Asp Arg Lys Ala Ser Gin Asp Lys Leu Ser lie Val Asp 

180 185 190 

gac gaa gat gtc tta aca aat gat gac gat gag taa 
Asp Glu Asp Val Leu Thr Asn Asp Asp Asp Glu 

195 200 



<210> 8 
<211> 202 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 8 

Met His Arg Gin Asp Leu Asn Arg Glu Arg Lys Ser Asp Val Glu Leu 
1 5 10 15 



Lys Glu Phe Asp Gly Lys Lys Lys Glu Glu Leu Ala Met lie Asp Val 

20 25 30 



Ala Lys Ala lie Leu Asp Gin Val His Asp Leu Met His Phe Asn Asp 
35 40 45 



Leu Leu Ser Glu Val Ser Glu Tyr Leu Asp Leu Ser Asp Asp Glu lie 
50 55 60 



Glu Ser Gly Met Gly Gin Phe Tyr Thr Asp Leu Asn lie Asp Gly Arg 
65 " 70 75 80 



Phe He Ser Leu Gly Asp Asn His Trp Gly Leu Arg Glu Trp Tyr Pro 

85 90 95 



Val Asp Ser lie Asp Glu Glu Leu Thr His Asp Asn Asp Leu Glu Lys 

100 105 110 



432 



480 



528 



576 



612 
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Val 


TPViT" 


Pro 
115 




Gin 

VfJ> jl 


Ala 


Glu 


Asp 
120 


Gly 


Phe 


Asp 


Asp 


Leu 
125 


Glu 


His 


Val 


VJ7 _L U 


T,v<? 

130 


Glu 


Val 


Met 


Aso 


Asp 
135 


Ala 


Lys 


Glu 


Glu 


Leu 
140 


Asp Asp 


Gin 


Ala 


Val 

V U 

145 




Glu 


ASTD 


Glu 


Glu 
150 


Asn 


Val 


Ala 


Pro 


Asp 
155 


Glu 


He 


Thr 


Asp 


Asp 
160 


Gly 


Asp 


Glu 


Asp 


Lys 
165 


Leu 


Asp 


Glu 


Tyr 


Ser 
170 


Ser 


Asp 


He 


Glu 


Asp 
175 


Leu 


Glu 


Asp 


Asp 


Arg 
180 


Lys 


Ala 


Ser 


Gin 


Asp 
185 


Lys 


Leu 


Ser 


He Val 
190 


Asp 


Asp 


Glu 


Asp 


Val 
195 


Leu 


Thr 


Asn 


Asp 


Asp 
200 


Asp 


Glu 














<210> 9 
<211> 942 
<212> DNA 

<213> Alloiococcus otitidis 




















<220> 
<221> CDS 
<222> (1) . . 
<223> 


(942) 
























<400> 9 
atg ate 
Met lie 
1 


gaa 
Glu 


att 
lie 


gaa 
Glu 
5 


aag 
Lys 


cca 
Pro 


gta 
Val 


att 
He 


gaa 
Glu 
10 


aca 
Thr 


gta 
Val 


gag 
Glu 


ate 
He 


agt 
Ser 
15 


gaa 
Glu 


gat 
Asp 


ggc 

Gly 


aaa 
Lys 


ttc 
Phe 
20 


ggt 
Gly 


aag 
Lys 


ttt 
Phe 


gtt 
Val 


gtt 
val 
25 


gaa 
Glu 


cca 
Pro 


ttg 
Leu 


gaa 
Glu 


cgt 
Arg 
30 


ggt tat 
Gly Tyr 


ggg 

Gly 


act 
Thr 


acc 
Thr 
35 


tta 
Leu 


ggg 

Gly 


aat 
Asn 


tec 
Ser 


tta 
Leu 
40 


cgc 
Arg 


cgc 
Arg 


ate 
He 


tta 
Leu 


tta 
Leu 
45 


tea 
Ser 


tea 
Ser 


eta 
Leu 


ccg 
Pro 


ggt 

Gly 
50 


get 
Ala 


gcg 
Ala 


gtc 
Val 


acc 
Thr 


aat 
Asn 
55 


att 
He 


caa 
Gin 


att 
He 


gat 
Asp 


ggt gtt 
Gly Val 
60 


ttg 
Leu 


cat 
His 


gag 
Glu 


ttt 
Phe 
65 


aca 
Thr 


get 
Ala 


att 
lie 


gat 
Asp 


ggt 

Gly 
70 


gtg 

Val 


gtt 
val 


gaa 
Glu 


gat 

Asp 


gtg 

Val 
75 


act 
Thr 


tec 
Ser 


ate 
He 


ate 
He 


tta 
Leu 
80 


aac 


ctg 


aaa 


aaa 


ctg 


get 


tta 


aaa 


ctt 


cat 


act 


gaa 


gaa 


aca 


aaa 


aca 



48 



96 



144 



192 



240 



288 
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Asn Leu Lys Lys Leu Ala Leu Lys Leu His Thr Glu Glu Thr Lys Thr 

85 90 95 

att gaa ttg gat att gaa ggc cct get gaa gtg aca gca get gat att 33 6 

lie Glu Leu Asp He Glu Gly Pro Ala Glu Val Thr Ala Ala Asp He 

100 105 110 

att act gat agt gat gtt gag att atg aat cca gac eta tac ttg tgt 384 

He Thr Asp Ser Asp Val Glu He Met Asn Pro Asp Leu- Tyr Leu Cys 

115 120 125 

act gtt tct gaa ggt ggt cat tta cac ate egg atg gaa gca gaa act 432 

Thr Val Ser Glu Gly Gly His Leu His He Arg Met Glu Ala Glu Thr 

130 135 140 

ggt aga ggt tat gtg aat gca gag cac aac aag cat gat gat atg cca 48 0 

Gly Arg Gly Tyr Val Asn Ala Glu His Asn Lys His Asp Asp Met Pro 

145 150 * 155 160 

ate ggt gtt ttg cca att gat tea att tat acc cca att age cgt gtc 528 

lie Gly Val Leu Pro He Asp Ser He Tyr Thr Pro He Ser Arg Val 

165 170 175 

aac tat act gtt gaa gac acc cgc gtt ggt gaa cgc gag caa tat gat 57 6 

Asn Tyr Thr Val Glu Asp Thr Arg Val Gly Glu Arg Glu Gin Tyr Asp 

180 185 190 

aag tta acc ctg gat att tgg aca gat gga tec ate tec cca gag gat 624 

Lys Leu Thr Leu Asp He Trp Thr Asp Gly Ser He Ser Pro Glu Asp 

195 200 205 

ggc ttg agt eta gcg get aag ate atg aat gaa cac ttg aac ate ttc 672 

Gly Leu Ser Leu Ala Ala Lys He Met Asn Glu His Leu Asn He Phe 

210 215 220 

ate aac tta act gag caa gca cgt gaa gcg gac att atg gtt gaa aaa 720 

He Asn Leu Thr Glu Gin Ala Arg Glu Ala Asp He Met Val Glu Lys 

225 230 235 240 

gaa gaa gac cag aaa gaa aaa atg ctt gag atg acc ate gaa gag ctt 768 

Glu Glu Asp Gin Lys Glu Lys Met Leu Glu Met .Thr He Glu Glu Leu 

245 ' 250 255 

gat tta tct gtt egg tct tac aac tgt ttg aaa cgt get ggc ate aat 816 

Asp Leu Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn 

260 265 270 

act gtc caa gaa eta acg gac aaa act gaa ccg gaa atg atg aaa gtt 864 

Thr Val Gin Glu Leu Thr Asp Lys Thr Glu Pro Glu Met Met Lys Val 

275 280 285 

cgc aat etc gga cgt aag tea tta gaa gaa gtt aaa aac aag ctt gat 912 

Arg Asn Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Asn Lys Leu Asp 

290 295 300 



gac tta gac eta age ttg aaa gaa gaa tag 
Asp Leu Asp Leu Ser Leu Lys Glu Glu 



942 
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305 310 



<210> 10 
<211> 313 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 10 

Met lie Glu lie Glu Lys Pro Val lie Glu Thr Val Glu lie Ser Glu 
15 10 15 



Asp Gly Lys Phe Gly Lys Phe Val Val Glu Pro Leu Glu Arg Gly Tyr 

20 25 30 



Gly Thr Thr Leu Gly Asn Ser Leu Arg Arg lie Leu Leu Ser Ser Leu 
35 40 45 



Pro Gly Ala Ala Val Thr Asn lie Gin lie Asp Gly Val Leu His Glu 
50 55 60 



Phe Thr Ala lie Asp Gly Val Val Glu Asp Val Thr Ser lie lie Leu 
65 70 75 80 



Asn Leu Lys Lys Leu Ala Leu Lys Leu His Thr Glu Glu Thr Lys Thr 

85 90 95 



He Glu Leu Asp He Glu Gly Pro Ala Glu Val Thr Ala Ala Asp He 

100 105 110 



He Thr Asp Ser Asp Val Glu He Met Asn Pro Asp Leu Tyr Leu Cys ' 
115 120 125 



Thr Val Ser Glu Gly Gly His Leu His He Arg Met Glu Ala Glu Thr 
130 135 140 



Gly Arg Gly Tyr Val Asn Ala Glu His Asn Lys His Asp Asp Met Pro 
145 "* " 150 155 160 



He Gly Val Leu Pro He Asp Ser He Tyr Thr Pro He Ser Arg Val 

165 170 175 



Asn Tyr Thr Val Glu Asp Thr Arg Val Gly Glu Arg Glu Gin Tyr Asp 

180 185 190 
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Lys Leu Thr Leu Asp lie Trp Thr Asp Gly Ser He Ser Pro Glu Asp 
195 200 205 



Gly Leu Ser Leu Ala Ala Lys He Met Asn Glu His Leu Asn He Phe 
210 215 220 



He Asn Leu Thr Glu Gin Ala Arg Glu Ala Asp He Met Val Glu Lys 
225 230 235 240 



Glu Glu Asp Gin Lys Glu Lys Met Leu Glu Met Thr He Glu Glu Leu 

245 250 255 



Asp Leu Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn 

260 265 270 



Thr Val Gin Glu Leu Thr Asp Lys Thr Glu Pro Glu Met Met Lys Val 
275 280 285 



Arg Asn Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Asn Lys Leu Asp 
290 295 300 



Asp Leu Asp Leu Ser Leu Lys Glu Glu 
305 310 



<210> 11 
<211> 3681 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (22) . . (3681) 

<223> 

<400> 11 

aataaaggga ggtttgcccc c ttg gta gat gta aat aat ttt gaa agt att 51 

Met Val Asp Val Asn Asn Phe Glu Ser He 
15 10 

caa att gga ctg get tea cca gag aaa ate cgt tea tgg tct cat ggt 99 
Gin He Gly Leu Ala Ser Pro Glu Lys He Arg Ser Trp Ser His Gly 

15 20 25 

gaa gtg aag aaa cct gaa acc att aac tac egg aca tta aaa cct gaa 147 
Glu Val Lys Lys Pro Glu Thr He Asn Tyr Arg Thr Leu Lys Pro Glu 

30 35 40 



aaa gac ggt ttg ttc tgc gaa cgc att ttt ggc cca acc aag gac tat 195 
Lys Asp Gly Leu Phe Cys Glu Arg He Phe Gly Pro Thr Lys Asp Tyr 
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45 50 55 

gaa tgt get tgc gga aaa tat aaa cga gtc cac tat aaa ggg ata gtt 243 
Glu Cys Ala Cys Gly Lys Tyr Lys Arg Val His Tyr Lys Gly lie Val 
60 65 70 



tgt gac cgt tgc ggt gtt gaa gtc acc aag teg agt gtc aga cga gaa 
Cys Asp Arg Cys Gly Val Glu Val Thr Lys Ser Ser Val Arg Arg Glu 
75 " ~ 80 85 90 



egg acc egg get att cgt cgt tta gac att att gac tec ttc aag tct 
Arg Thr Arg Ala lie Arg Arg Leu Asp He He Asp Ser Phe Lys Ser 
205 210 215 



acc 



age gac ttg aac gac ttg tac cgc egg gtg att aac egg aac aac 
Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val lie Asn Arg Asn Asn 

255 260 265 



291 



cgc atg ggc cac ttg gaa tta gca get cct gtc acc cac att tgg tac 339 
Arg Met Gly His Leu Glu Leu Ala Ala Pro Val Thr His He Trp Tyr 

95 100 105 

ttc aag ggt att cca agt egg atg ggc ctt ate tta gat atg age cca 387 
Phe Lys Gly He Pro Ser Arg Met Gly Leu He Leu Asp Met Ser Pro 

110 115 120 

aga tec ttg gaa gaa att ate tat ttt gee tct tat gtt gtt att gac 435 
Arg Ser Leu Glu Glu He He Tyr Phe Ala Ser Tyr Val Val He Asp 
125 130 135 

ggt ggg gat acc ccg ctt gaa cgc aaa cag etc tta act gaa cgt gaa 483 
Gly Gly Asp Thr . Pro Leu Glu Arg Lys Gin Leu Leu Thr Glu Arg Glu 
140 145 150 

tac egg gaa aac aaa age aag tac ggc aat gaa ttc caa get gaa att 531 
Tyr Arg Glu Asn Lys Ser Lys Tyr Gly Asn Glu Phe Gin Ala Glu He 
155 160 165 170 

gga get gaa get gtt egg acc ttg eta aaa aat gtc gat ttg gaa caa 579 
Gly Ala Glu Ala Val Arg Thr Leu Leu Lys Asn Val Asp Leu Glu Gin 

175 180 185 

gaa gtt get gac etc aaa gaa ate tta gaa act gca act ggc caa aaa 627 
Glu Val Ala Asp Leu Lys Glu He Leu Glu Thr Ala Thr Gly Gin Lys 

190 195 200 



675 



tec aac aac aaa ccg gaa tgg atg gtc ttg gat get att cca att ate 723 

Ser Asn Asn Lys Pro Glu Trp Met Val Leu Asp Ala He Pro He He 
220 "* 225 230 

cca cct gaa etc cgc cca atg gta caa eta gaa ggt ggc egg ttt gca 771 

Pro Pro Glu Leu Arg Pro Met Val Gin Leu Glu Gly Gly Arg Phe Ala 
235 240 245 250 



819 



egg ttg aaa cgc ttg ctt gac ttg aat gee ccc cac att ate gtc caa 867 
Arg Leu Lys Arg Leu Leu Asp Leu Asn Ala Pro His He He Val Gin 

270 275 280 
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aat gaa aaa egg atg ctg caa gaa get gtt gac gec ttg att gac aat 915 
Asn Glu Lys Arg Met Leu Gin Glu Ala Val Asp Ala Leu lie Asp Asn 
285 290 295 



ggt cgt cgc ggt egg gca gtc aac ggt cct ggt aac cgt ccg ctt aaa 
Gly Arg Arg Gly Arg Ala Val Asn Gly Pro Gly Asn Arg Pro Leu Lys 
300 305 310 



eta eta ggg aaa egg gtt gac tac tct ggc egg tct gtc att gtt gtt 
Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg Ser Val lie Val Val 

335 340 345 



963 



tct ctt tct cac atg ttg aaa ggg aaa caa ggg cgc ttc cgt cag aac 1011 
Ser Leu Ser His Met Leu Lys Gly Lys Gin Gly Arg Phe Arg Gin Asn 
315 320 325 330 



1059 



ggg cca acc ctt aaa atg tac caa tgt ggt eta ccg aaa gaa atg" gee 1107 
Gly Pro Thr Leu Lys Met Tyr Gin Cys Gly Leu Pro Lys Glu Met Ala 

350 355 360 

ate gaa etc ttc aaa cct ttt gtc atg egg gag eta gtt gag cga gat 1155 
He Glu Leu Phe Lys Pro Phe Val Met Arg Glu Leu Val Glu Arg Asp 
365 370 375 

att gca aat aac att aaa aat gec aaa cga aaa gtg gaa egg atg gaa 1203 
He Ala Asn Asn He Lys Asn Ala Lys Arg Lys Val Glu Arg Met Glu 
380 385 390 

gat gat gtc tgg cct gtt tta gaa gat gtc att aaa gaa cac cct gtc 1251 
Asp Asp Val Trp Pro Val Leu Glu Asp Val He Lys Glu His Pro Val 
395 400 405 410 

etc tta aac egg gec cct acc ctt cac egg eta ggg ate caa gee ttt 1299 
Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu Gly He Gin Ala Phe 

415 420 425 

gaa cct gtc ctt gtc aat ggg aag get att cgc tta cac cca etc get 1347 
Glu Pro Val Leu Val Asn Gly Lys Ala He Arg Leu His Pro Leu Ala 

430 435 440 

tgt gaa gec tac aat get gac ttt gac gga gac caa atg get gtc cac 13 95 

Cys Glu Ala Tyr Asn Ala Asp Phe Asp Gly Asp Gin Met Ala Val His 
445 450 455 

gta ccc etc agt gat gaa gee cag gca gaa gee cgc ate tta atg ctg 1443 
Val Pro Leu Ser Asp Glu Ala Gin Ala Glu Ala Arg He Leu Met Leu 
460 465 470 

ggt gee caa aat ate tta aac cct aaa gat ggt caa cca gtc gtt acc 1491 
Gly Ala Gin Asn He Leu Asn Pro Lys Asp Gly Gin Pro Val Val Thr 
475 480 485 490 

cct tec caa gac atg gtc eta ggg aac tac tac eta acc atg gaa gaa 1539 
Pro Ser Gin Asp Met Val Leu Gly Asn Tyr Tyr Leu Thr Met Glu Glu 

495 500 505 
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gaa ggt aaa att ggt gaa gga act gtc ttc tec agt get tct gag get 1587 
Glu Gly Lys He Gly Glu Gly Thr Val Phe Ser Ser Ala Ser Glu Ala 

510 ~ 515 520 

ate caa gee tac caa aca ggc tat gtc cac etc cac ace egg gtt gcg 163 5 

He Gin Ala Tyr Gin Thr Gly Tyr Val His Leu His Thr Arg Val Ala 
525 530 535 



ate cgt gcg gtg gac tta ccg gac aaa cct ttt act gac tgg cag aaa 
He Arg Ala Val Asp Leu Pro Asp Lys Pro Phe Thr Asp Trp Gin Lys 
540 545 550 



gaa cag caa ace cca gac aag tac ttt gtc gac egg ggc caa aac ttg 
Glu Gin Gin Thr Pro Asp Lys Tyr Phe Val Asp Arg Gly Gin Asn Leu 

590 595 600 

aaa gac ctt att gee gac cgt cct tta gtt cag cct ttc aaa aaa caa 
Lys Asp Leu He Ala Asp Arg Pro Leu Val Gin Pro Phe Lys Lys Gin 
605 610 615 

gac ctg tec aac att ate gee gaa gtc ttt aat aac ttc caa gtg acc 
Asp Leu Ser Asn He He Ala Glu Val Phe Asn Asn Phe Gin Val Thr 
620 625 630 



tct acc egg tct ggt att act gtt ggg att get gac gtt tea gtc eta 
Ser Thr Arg Ser Gly He Thr Val Gly He Ala Asp Val Ser Val Leu 

655 660 665 

gaa get aaa cca gaa ate ctg aaa gaa gec cac gee aag gtt gat aaa 
Glu Ala Lys Pro Glu He Leu Lys Glu Ala His Ala Lys Val Asp Lys 

670 675 680 

ate aat gec acc cac cgc cgc ggt tta att act gaa gaa gag cgt tac 
He Asn Ala Thr His Arg Arg Gly Leu He Thr Glu Glu Glu Arg Tyr 
685 690 695 



gee ttg atg gat tec ctt gac cca aga aat aac ate ttt atg atg tea 
Ala Leu Met Asp Ser Leu Asp Pro Arg Asn Asn He Phe Met Met Ser 
715 720 725 730 



1683 



gac aag tac ttg att acc aca gtc ggt aag att ate ttt aat gaa att 1731 
Asp Lys Tyr Leu He Thr Thr Val Gly Lys He He Phe Asn Glu He 
555 560 565 570 

atg cca gca gaa ttt cca ttc ttg aac gaa cca tct aag gtt aac ctg 1779 
Met Pro Ala Glu Phe Pro Phe Leu Asn Glu Pro Ser Lys Val Asn Leu 

575 580 585 



1827 



1875 



1923 



gaa acc tct aaa atg ttg gac cgc atg aag aac ttg ggc tac aag tac 1971 
Glu Thr Ser Lys Met Leu Asp Arg Met Lys Asn Leu Gly Tyr Lys Tyr 
635 640 645 650 



2019 



2067 



2115 



gac aac gtt ate gat gtc tgg caa aag get aag gat gaa att caa gat 2163 
Asp Asn Val He Asp Val Trp Gin Lys Ala Lys Asp Glu He Gin Asp 
700 705 710 



2211 



gac tct ggt gee cgt ggg aat att tec aac ttc acc caa eta gec ggt 



2259 
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Asp Ser Gly Ala Arg Gly Asn lie Ser Asn Phe Thr Gin Leu Ala Gly 

735 740 745 

atg cgt ggt ttg atg gca gca cca agt ggt gag ate atg gaa ttg ccg 23 07 

Met Arg Gly Leu Met Ala Ala Pro Ser Gly Glu lie Met Glu Leu Pro 

750 755 760 

ate acg tct aac ttc cgt gaa ggc ctg tct gtc tta gag atg ttt att 2355 
He Thr Ser Asn Phe Arg Glu Gly Leu Ser Val Leu Glu Met Phe He 
765 770 775 

tec acc cac ggt gec cgt aaa ggc atg acc gat acc gec ctt aaa act 2403 
Ser Thr His Gly Ala Arg Lys Gly Met Thr Asp Thr Ala Leu Lys Thr 
780 785 790 

gec gac tct ggt tac ttg acc aga cgt ttg gtt gat gtt gec caa gac 2451 
Ala Asp Ser Gly Tyr Leu Thr Arg Arg Leu Val Asp Val Ala Gin Asp 
795 800 805 810 

gtc ate ate cga gaa gaa gac tgt ggc act aaa cgt ggc ctt aaa gtt 2499 
Val He He Arg Glu Glu Asp Cys Gly Thr Lys Arg Gly Leu Lys Val 

815 820 825 

tct gee ate caa gta gga aat gaa cag att gaa age ttg tct gac cgt 2 547 

Ser Ala He Gin Val Gly Asn Glu Gin He Glu Ser Leu Ser Asp Arg 

830 835 840 

ate ttg ggt cgt tat gec caa gaa acc gtc acc cac ccc gaa act ggt 2595 
He Leu Gly Arg Tyr Ala Gin Glu Thr Val Thr His Pro Glu Thr Gly 
845 850 855 

gaa gtc att gtt cac aag gat gaa ttg att gat gaa ggc aaa acc cga 2 643 

Glu Val He Val His Lys Asp Glu Leu He Asp Glu Gly Lys Thr Arg 
860 865 870 

aaa att gtc gat gec ggt att gaa gaa gtt act ate egg tct gee ttc 2691 
Lys He Val Asp Ala Gly He Glu Glu Val Thr He Arg Ser Ala Phe 
875 880 885 890 

tgc tgc aac acc aac cac ggt gtc tgc aag cac tgc tat ggc cgt aac 2739 
Cys Cys Asn Thr Asn His Gly Val Cys Lys His Cys Tyr Gly Arg Asn 

895 900 905 

ttg gca act ggc egg gaa gtt gaa gtt ggt gaa gca gtt gga act ate 2787 
Leu Ala Thr Gly Arg Glu Val Glu Val Gly Glu Ala Val Gly Thr He 

910 915 920 

get gee caa tec att ggg gaa ccc ggt acc caa ttg acc atg egg acc 2835 
Ala Ala Gin Ser He Gly Glu Pro Gly Thr Gin Leu Thr Met Arg Thr 
925 930 935 

ttc cac act ggt ggg gtc get ggg gac gac ate acc caa ggt eta cca 2883 
Phe His Thr Gly Gly Val Ala Gly Asp Asp He Thr Gin Gly Leu Pro 
940 945 950 

egg gtt caa gaa ate ttt gaa gec cgc cat ccg aaa ggg caa gee acc 2931 
Arg Val Gin Glu He Phe Glu Ala Arg His Pro Lys Gly Gin Ala Thr 



on 
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955 960 965 970 

att aca gaa gtg aat ggt caa ate caa gag ate gtt gaa gac cct gaa 2979 
lie Thr Glu Val Asn Gly Gin lie Gin Glu lie Val Glu Asp Pro Glu 

975 980 985 

gaa cgc act aag acc gtc act gtt aag ggg aat gtt gac caa cgt gac 3027 
Glu Arg Thr Lys Thr Val Thr Val Lys Gly Asn Val Asp Gin Arg Asp 

990 995 1000 

tac tec ttg cca ate aat gee egg atg aag gtt gaa gtt ggg gat tat 3 075 

Tyr Ser Leu Pro He Asn Ala Arg Met Lys Val Glu Val Gly Asp Tyr 
1005 1010 1015 

gtt gaa cga ggc gat get eta aac gag ggg tct att gat ccg aaa gag 3123 
Val Glu Arg Gly Asp Ala Leu Asn Glu Gly Ser He Asp Pro Lys Glu 
1020 1025 1030 

tta etc gcg gtg agt gat atg atg aaa ttg cag aaa tac etc ttg caa 3171 
Leu Leu Ala Val Ser Asp Met Met Lys Leu Gin Lys Tyr Leu Leu Gin 
1° 35 1040 1045 1050 

gaa gtc caa tac get tac egg tct caa ggg gtc gaa att ggt gac aag 3219 
Glu Val Gin Tyr Ala Tyr Arg Ser Gin Gly Val Glu lie Gly Asp Lys 

1055 1060 1065 

cac gtg gag gtt atg gtg cga caa atg etc cgt aaa gtc cgt gtc ttg 3267 
His Val Glu Val Met Val Arg Gin Met Leu Arg Lys Val Arg Val Leu 

1070 1075 1080 

caa cca ggg gac act gat ate ctg cct ggt acc atg att gac etc cac 3315 
Gin Pro Gly Asp Thr Asp He Leu Pro Gly Thr Met He Asp Leu His 
1085 1090 1095 

gac ttc aag gaa cgc aac caa gaa acc ttg atg tec ggt ggc caa ccc 33 63 

Asp Phe Lys Glu Arg Asn Gin Glu Thr Leu Met Ser Gly Gly Gin Pro 
1100 1105 1110 

gca act get aga ctg gtc eta ctg ggt att acc aag gec tec ctt gaa 3411 
Ala Thr Ala Arg Leu Val Leu Leu Gly He Thr Lys Ala Ser Leu Glu 
1115 1120 1125 1130 

acc aac tct ttc ttg tct gca get tec ttc caa gaa acc acc egg gtc 3459 
Thr Asn Ser Phe Leu Ser Ala Ala Ser Phe Gin Glu Thr Thr Arg Val 

1135 1140 1145 

etc acc gat gca get att cgc ggt aaa gtt gat gac ctg gtt ggc ttg 3507 
Leu Thr Asp Ala Ala lie Arg Gly Lys Val Asp Asp Leu Val Gly Leu 

1150 1155 * ^ 1160 

aaa gaa aat gtt att ate ggt aaa tec ate cca get ggt act ggt atg 3555 
Lys Glu Asn Val He He Gly Lys Ser He Pro Ala Gly Thr Gly Met 
1165 1170 1X75 

aga gee tac agt aat att gaa cct aaa aaa gtt ggt gtc gtt age gaa 3603 
Arg Ala Tyr Ser Asn He Glu Pro Lys Lys Val Gly Val Val Ser Glu 
H80 1185 H90 



9.1 
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aat gtc tac age ate aat gaa gaa gac caa gtc agt caa gaa gaa aac 3651 
Asn Val Tyr Ser lie Asn Glu Glu Asp Gin Val Ser Gin Glu Glu Asn 
1195 ~ 1200 1205 1210 



cga gaa act gaa gaa act age gag aaa taa 
Arg Glu Thr Glu Glu Thr Ser Glu Lys 

1215 



<210> 12 
<211> 1219 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 12 

Met Val Asp Val Asn Asn Phe Glu Ser lie Gin lie Gly Leu Ala Ser 
15 10 15 

Pro Glu Lys lie Arg Ser Trp Ser His Gly Glu Val Lys Lys Pro Glu 

20 25 3 0 



Thr lie Asn Tyr Arg Thr Leu Lys Pro Glu Lys Asp Gly Leu Phe Cys 
35 40 45 



Glu Arg lie Phe Gly Pro Thr Lys Asp Tyr Glu Cys Ala Cys Gly Lys 
50 55 60 



Tyr Lys Arg Val His Tyr Lys Gly lie Val Cys Asp Arg Cys Gly Val 
65 70 75 80 



Glu Val Thr Lys Ser Ser Val Arg Arg Glu Arg Met Gly His Leu Glu 

85 90 95 



Leu Ala Ala Pro Val Thr His lie Trp Tyr Phe Lys Gly He Pro Ser 

100 105 110 



Arg Met Gly Leu He Leu Asp Met Ser Pro Arg Ser Leu Glu Glu He 
115 . 120 125 



He Tyr Phe Ala Ser Tyr Val Val He Asp Gly Gly Asp Thr Pro Leu 
130 135 140 



Glu Arg Lys Gin Leu Leu Thr Glu Arg Glu Tyr Arg Glu Asn Lys Ser 
145 "* " 150 155 160 

Lys Tyr Gly Asn Glu Phe Gin Ala Glu He Gly Ala Glu Ala Val Arg 



3681 
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165 170 175 



Thr Leu Leu Lys Asn Val Asp Leu Glu Gin Glu Val Ala Asp Leu Lys 

180 185 190 



Glu lie Leu Glu Thr Ala Thr Gly Gin Lys Arg Thr Arg Ala lie Arg 
195 200 205 



Arg Leu Asp lie He Asp Ser Phe Lys Ser Ser Asn Asn Lys Pro Glu 
210 215 220 



Trp Met Val Leu Asp Ala He Pro He He Pro Pro Glu Leu Arg Pro 
225 230 235 240 



Met Val Gin Leu Glu Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp 

245 250 255 



Leu Tyr Arg Arg Val He Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu 

260 265 270 



Asp Leu Asn Ala Pro His He He Val Gin Asn Glu Lys Arg Met Leu 
275 280 285 



Gin Glu Ala Val Asp Ala Leu He Asp Asn Gly Arg Arg Gly Arg Ala 
290 295 300 



Val Asn Gly Pro Gly Asn Arg Pro Leu Lys Ser Leu Ser His Met Leu 
305 ~ 310 315 320 



Lys Gly Lys Gin Gly Arg Phe Arg Gin Asn Leu Leu Gly Lys Arg Val 

325 330 335 



Asp Tyr Ser Gly Arg Ser Val He Val Val Gly Pro Thr Leu Lys Met 

340 ~ 345 350 



Tyr Gin Cys Gly Leu Pro Lys Glu Met Ala lie Glu Leu Phe Lys Pro 
3 55 • 3 60 365 



Phe Val Met Arg Glu Leu Val Glu Arg Asp He Ala Asn Asn He Lys 
37 0 375 380 



Asn Ala Lys Arg Lys Val Glu Arg Met Glu Asp Asp Val Trp Pro Val 
385 390 395 400 
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Leu Glu Asp Val lie Lys Glu His Pro Val Leu Leu Asn Arg Ala Pro 

405 410 415 



Tbr Leu His Arg Leu Gly lie Gin Ala Phe Glu Pro Val Leu Val Asn 

420 425 430 



Gly Lys Ala lie Arg Leu His Pro Leu Ala Cys Glu Ala Tyr Asn Ala 
435 440 445 



Asp Phe Asp Gly Asp Gin Met Ala Val His Val Pro Leu Ser Asp Glu 
450 455 460 



Ala Gin Ala Glu Ala Arg lie Leu Met Leu Gly Ala Gin Asn lie Leu 

465 470 475 480 

Asn Pro Lys Asp Gly Gin Pro Val Val Thr Pro Ser Gin Asp Met Val 

485 490 495 



Leu Gly Asn Tyr Tyr Leu Thr Met Glu Glu Glu Gly Lys lie Gly Glu 

500 505 510 



Gly Thr Val Phe Ser Ser Ala Ser Glu Ala He Gin Ala Tyr Gin Thr 
515 520 525 



Gly Tyr Val His Leu His Thr Arg Val Ala He Arg Ala Val Asp Leu 
530 535 540 



Pro Asp Lys Pro Phe Thr Asp Trp Gin Lys Asp Lys Tyr Leu He Thr 
545 " 550 555 560 



Thr Val Gly Lys He He Phe Asn Glu He Met Pro Ala Glu Phe Pro 

565 570 575 



Phe Leu Asn Glu Pro Ser Lys Val Asn Leu Glu Gin Gin Thr Pro Asp 

580 585 590 



Lys Tyr Phe Val Asp Arg Gly Gin Asn Leu Lys Asp Leu He Ala Asp 
595 600 605 



Arg Pro Leu Val Gin Pro Phe Lys Lys Gin Asp Leu Ser Asn He He 
610 615 620 
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Ala Glu Val Phe Asn Asn Phe Gin Val Thr Glu Thr Ser Lys Met Leu 
625 630 635 640 



Asp Arg Met Lys Asn Leu Gly Tyr Lys Tyr Ser Thr Arg Ser Gly He 

645 650 655 



Thr Val Gly He Ala Asp Val Ser Val Leu Glu Ala Lys Pro Glu He 

660 665 670 



Leu Lys Glu Ala His Ala Lys Val Asp Lys He Asn Ala Thr His Arg 
675 680 685 



Arg Gly Leu He Thr Glu Glu Glu Arg Tyr Asp Asn Val He Asp Val 
690 695 700 



Trp Gin Lys Ala Lys Asp Glu He Gin Asp Ala Leu Met Asp Ser Leu 
705 " 710 715 720 



Asp Pro Arg Asn Asn He Phe Met Met Ser Asp Ser Gly Ala Arg Gly 

725 730 735 



Asn He Ser Asn Phe Thr Gin Leu Ala Gly Met Arg Gly Leu Met Ala 

740 745 750 



Ala Pro Ser Gly Glu He Met Glu Leu Pro He Thr Ser Asn Phe Arg 
755 760 765 



Glu Gly Leu Ser Val Leu Glu Met Phe He Ser Thr His Gly Ala Arg 
770 775 780 



Lys Gly Met Thr Asp Thr Ala Leu Lys Thr Ala Asp Ser Gly Tyr Leu 
785 790 795 800 



Thr Arg Arg Leu Val Asp Val Ala Gin Asp Val He He Arg Glu Glu 

805 810 815 



Asp Cys Gly Thr Lys Arg Gly Leu Lys Val Ser Ala He Gin Val Gly 

820 825 830 



Asn Glu Gin He Glu Ser Leu Ser Asp Arg He Leu Gly Arg Tyr Ala 
835 840 845 
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Gin Glu Thr Val Thr His Pro Glu Thr Gly Glu Val lie Val His Lys 
850 855 860 



Asp Glu Leu lie Asp Glu Gly Lys Thr Arg Lys lie Val Asp Ala Gly 
865 870 875 880 



He Glu Glu Val Thr He Arg Ser Ala Phe Cys Cys Asn Thr Asn His 

885 890 895 



Gly Val Cys Lys His Cys Tyr Gly Arg Asn Leu Ala Thr Gly Arg Glu 

900 905 910 



Val Glu Val Gly Glu Ala Val Gly Thr He Ala Ala Gin Ser He Gly 
915 920 925 



Glu Pro Gly Thr Gin Leu Thr Met Arg Thr Phe His Thr Gly Gly Val 
930 935 940 



Ala Gly Asp Asp He Thr Gin Gly Leu Pro Arg Val Gin Glu He Phe 
945 ** " 950 955 960 



Glu Ala Arg His Pro Lys Gly Gin Ala Thr He Thr Glu Val Asn Gly 

965 970 975 



Gin He Gin Glu He Val Glu Asp Pro Glu Glu Arg Thr Lys Thr Val 

980 985 990 



Thr Val Lys Gly Asn Val Asp Gin Arg Asp Tyr Ser Leu Pro He Asn 
995 1000 1005 



Ala Arg Met Lys Val Glu Val Gly Asp Tyr Val Glu Arg Gly Asp Ala 
1010 1015 1020 



Leu Asn Glu Gly Ser He Asp Pro Lys Glu Leu Leu Ala Val Ser Asp 
1025 1030 1035 1040 



Met Met Lys Leu Gin Lys Tyr Leu Leu Gin Glu Val Gin Tyr Ala Tyr 

1045 1050 1055 



Arg Ser Gin Gly Val Glu He Gly Asp Lys His Val Glu Val Met Val 

1060 1065 1070 



Arg Gin Met Leu Arg Lys Val Arg Val Leu Gin Pro Gly Asp Thr Asp 
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1075 1080 1085 

He I»eu Pro Gly Thr Met He Asp Leu His Asp Phe Lys Glu Arg Asn 
1090 1095 1100 

Gin Glu Thr Leu Met Ser Gly Gly Gin Pro Ala Thr Ala Arg Leu Val 
1105 1110 1115 1120 

Leu Leu Gly He Thr Lys Ala Ser Leu Glu Thr Asn Ser Phe Leu Ser 

1125 1130 1135 



Ala Ala Ser Phe Gin Glu Thr Thr Arg Val Leu Thr Asp Ala Ala He 

1140 1145 1150 

Arg Gly Lys Val Asp Asp Leu Val Gly Leu Lys Glu Asn Val He He 
1155 1160 1165 

Gly Lys Ser He Pro Ala Gly Thr Gly Met Arg Ala Tyr Ser Asn He 
1170 1175 1180 

Glu Pro Lys Lys Val Gly Val Val Ser Glu Asn Val Tyr Ser He Asn 
1185 H90 H95 1200 

Glu Glu Asp Gin Val Ser Gin Glu Glu Asn Arg Glu Thr Glu Glu Thr 

1205 1210 1215 



Ser Glu Lys 



<210> 13 
<211> 3582 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (3582) 

<223> 

<400> 13 

gag gtg aac aag ttg gtc ggt aaa aaa gtt aat ttt ggt aaa cac cgt 

Met Asn Lys Leu Val Gly Lys Lys Val Asn Phe Gly Lys His Arg 
1 5 10 15 

gtt cgt aga agt tac tea cga ate aac gaa gta etc gag etc ccg aat 
Val Arg Arg Ser Tyr Ser Arg He Asn Glu Val Leu Glu Leu Pro Asn 

20 25 30 



48 



96 
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tta att gaa ate cag act gat tea tat gat tgg ttt tta gat gaa ggc 
Leu lie Glu He Gin Thr Asp Ser Tyr Asp Trp Phe Leu Asp Glu Gly 

35 40 45 

ttg aag gaa atg ttt agt gat att tec cca ate gat gat ttt tea ggc 
Leu Lys Glu Met Phe Ser Asp He Ser Pro He Asp Asp Phe Ser Gly 
50 55 60 

aat ttg tec eta gaa ttt gtt gac tat aaa ttt tac gaa age aag tat 
Asn Leu Ser Leu Glu Phe Val Asp Tyr Lys Phe Tyr Glu Ser Lys Tyr 
65 70 75 

act gtt gaa gaa get aga gag cat gat gcg aac tat tct gee ccc etc 
Thr Val Glu Glu Ala Arg Glu His Asp Ala Asn Tyr Ser Ala Pro Leu 
80 85 90 95 

tac gtg aag tta cgt ttg ate aac aag gaa act ggt gaa gtc aag gaa 
Tyr Val Lys Leu Arg Leu He Asn Lys Glu Thr Gly Glu Val Lys Glu 

100 105 HO 

caa gaa gtc ttc ttc ggt gac ttt ccg tta atg aca gaa caa ggg ace 
Gin Glu Val Phe Phe Gly Asp Phe Pro Leu Met Thr Glu Gin Gly Thr 

115 120 125 

ttt ate ate aac ggg get gag egg gtg att gtt tec caa ctt gtc egg 
Phe He He Asn Gly Ala Glu Arg Val He Val Ser Gin Leu Val Arg 
130 135 140 

teg cct ggg gtt tat tac agt cca aaa gtt gag aaa aac ggc egg gaa 
Ser Pro Gly Val Tyr Tyr Ser Pro Lys Val Glu Lys Asn Gly Arg Glu 
145 . 150 155 

ggt ttt tea ace gtt ctt ate cct aac egg ggt get tgg ctt gaa tac 
Gly Phe Ser Thr Val Leu He Pro Asn Arg Gly Ala Trp Leu Glu Tyr 
160 165 170 175 

gaa aca gat ace aaa ggc ate tec aat gtt cga att gac cga acc cgt 
Glu Thr Asp Thr Lys Gly He Ser Asn Val Arg He Asp Arg Thr Arg 

180 185 190 

aaa att ccg- ate act gtc ttg tta aga get eta ggg att ggg tea gat 
Lys He Pro He Thr Val Leu Leu Arg Ala Leu Gly He Gly Ser Asp 

195 200 205 

gat gaa att att gac ctg ate ggc ttg aat gac age ttg gaa gee acc 
Asp Glu He He Asp Leu He Gly Leu Asn Asp Ser Leu Glu Ala Thr 
210 215 220 

ttg gaa aag gat gtc cac aag tct act tea gat tec cgc gta gaa gaa 
Leu Glu Lys Asp Val His Lys Ser Thr Ser Asp Ser Arg Val Glu Glu 
225 230 235 

gee ttg aaa gac ttg tat gaa cgc ttg cgt cca ggt gaa ccc aaa aca 
Ala Leu Lys Asp Leu Tyr Glu Arg Leu Arg Pro Gly Glu Pro Lys Thr 
240 245 250 255 

get gaa tec tct cgt aac ttg ate aat acc egg ttc ttt gac cac aag 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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Ala Glu Ser Ser Arg Asn Leu lie Asn Thr Arg Phe Phe Asp His Lys 

260 265 270 

cgt tac gac eta gec tat gtt ggt cgc tac aag atg aac aaa aaa eta 
Arg Tyr Asp Leu Ala Tyr Val Gly Arg Tyr Lys Met Asn Lys Lys Leu 

275 280 285 

gac ctt aaa acc cgc ttg atg ggg act gtc ctt gec gaa aac ctg gtt 
Asp Leu Lys Thr Arg Leu Met Gly Thr Val Leu Ala Glu Asn Leu Val 
290 295 300 

gat cct gaa get ggc gag gtc tta get gaa gaa ggt agt gaa gtg acc 
Asp Pro Glu Ala Gly Glu Val Leu Ala Glu Glu Gly Ser Glu Val Thr 
305 310 315 

egg tct gtg atg gac aag ctt ggc cct tac ctt gac ggt gac atg aac 
Arg Ser Val Met Asp Lys Leu Gly Pro Tyr Leu Asp Gly Asp Met Asn 
320 325 330 335 



gac eta caa att gtc aaa gtc tac tec aaa gaa gat cca gac egg ate 
Asp Leu Gin He Val Lys Val Tyr Ser Lys Glu Asp Pro Asp Arg lie 

355 360 365 



acc cct get gac atg ata gcg get atg agt tac ttc ttt aac etc caa 
Thr Pro Ala Asp Met He Ala Ala Met Ser Tyr Phe Phe Asn Leu Gin 
385 390 395 



ate egg tea gtc gga gag ctt ttg caa aac caa ttc cga att ggg etc 
He Arg Ser Val Gly Glu Leu Leu Gin Asn Gin Phe Arg lie Gly Leu 

420 425 430 



tct age acc aca ccc caa caa tta att aac ate cgt ccc gtt gta get 
Ser Ser Thr Thr Pro Gin Gin Leu He Asn He Arg Pro Val Val Ala 
450 455 460 



864 



912 



960 



1008 



caa gta acc att aac ccc tea gaa gaa gcg gtt ate cct gaa ccc att 1056 
Gin Val Thr He Asn Pro Ser Glu Glu Ala Val He Pro Glu Pro He 

340 345 350 



1104 



gtg aac atg ate ggc aac ggg cac cct gac aaa aag gee aaa tgg att 1152 
Val Asn Met He Gly Asn Gly His Pro Asp Lys Lys Ala Lys Trp He 
370 375 380 



1200 



gaa ggc att ggc gat gtt gac gat ate gac cac ttg ggt aac cgt egg 1248 
Glu Gly He Gly Asp Val Asp Asp He Asp His Leu Gly Asn Arg Arg 
400 405 410 415 



1296 



tct egg atg gag egg gtg gtc cgc gaa cga atg tec ate caa gac att 1344 
Ser Arg Met Glu Arg Val Val Arg Glu Arg Met Ser He Gin Asp He 

435 440 445 



1392 



tct ctg aaa gaa ttt ttc ggg tct tec caa etc tec caa ttc atg gac 1440 
Ser Leu Lys Glu Phe Phe Gly Ser Ser Gin Leu Ser Gin Phe Met Asp 
465 470 475 



caa acc aac ccc ttg ggt gag tta acc cac aaa cgt cgc ttg tct gec 
Gin Thr Asn Pro Leu Gly Glu Leu Thr His Lys Arg Arg Leu Ser Ala 



1488 
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480 



485 



490 



495 



ctt gga cca gga ggc ttg act agg gac egg get ggt tat gaa gtc cga 
Leu Gly Pro Gly Gly Leu Thr Arg Asp Arg Ala Gly Tyr Glu Val Arg 

500 505 510 

gac gtc cac tat tec cac tac ggc egg atg tgc ccg ate gaa aca cct 
Asp Val His Tyr Ser His Tyr Gly Arg Met Cys Pro lie Glu Thr Pro 

515 520 525 

gaa ggc cca aac att ggt ctg att aac agt ttg tct acc tat get aag 
Glu Gly Pro Asn He Gly Leu He Asn Ser Leu Ser Thr Tyr Ala Lys 
530 535 540 

ate aat aaa ttt ggt ttt att gaa aca cct tac cgc egg gtg gac egg 
He Asn Lys Phe Gly Phe He Glu Thr Pro Tyr Arg Arg Val Asp Arg 
545 550 555 

gaa act ggc cag gta acg gat aaa att gac tac ttg act get gac gaa 
Glu Thr Gly Gin Val Thr Asp Lys He Asp Tyr Leu Thr Ala Asp Glu 
560 565 570 575 

gaa gat ctt tac gtt gta gee caa gee aat get gaa tta gat gaa gat 
Glu Asp Leu Tyr Val Val Ala Gin Ala Asn Ala Glu Leu Asp Glu Asp 

580 585 590 

gga cat ttt get aat gat gtc gtc eta gee cga aga egg gat gtc aac 
Gly His Phe Ala Asn Asp Val Val Leu Ala Arg Arg Arg Asp Val Asn 

595 600 605 

gaa gag gtt gac get tec gaa gtt gac tat atg gac gtg tea cca aaa 
Glu Glu Val Asp Ala Ser Glu Val Asp Tyr Met Asp Val Ser Pro Lys 
610 615 620 

caa gtt gtt tct gtg gee aca get tec att cct ttc tta gaa aac gac 
Gin Val Val Ser Val Ala Thr Ala Ser He Pro Phe Leu Glu Asn Asp 
625 630 635 

gac tec aac egg get eta atg ggg get aac atg caa egg caa get gtt 
Asp Ser Asn Arg Ala Leu Met Gly Ala Asn Met Gin Arg Gin Ala Val 
640 645 650 655 

cct ctt atg caa cca gag tec cca eta gta gga act gga ate gaa cac 
Pro Leu Met Gin Pro Glu Ser Pro Leu Val Gly Thr Gly He Glu His 

660 665 670 

att gca gee cgt gac tct gga get gee gtt att gee aag get gac ggg 
He Ala Ala Arg Asp Ser Gly Ala Ala Val He Ala Lys Ala Asp Gly 

675 680 685 

gtt gtg gag tat gtt gat gee aag acg gtc aaa gtc cgt caa gec gat 
Val Val Glu Tyr Val Asp Ala Lys Thr Val Lys Val Arg Gin Ala Asp 
690 695 700 

ggt acc etc aac aac tac aag ctg get aag tac aaa egg tec aac tec 
Gly Thr Leu Asn Asn Tyr Lys Leu Ala Lys Tyr Lys Arg Ser Asn Ser 
705 710 715 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



r 
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gga act tct tac aac caa aga cca att gta aaa act ggt gag gaa gtt 
Gly Thr Ser Tyr Asn Gin Arg Pro He Val Lys Thr Gly Glu Glu Val 
720 725 730 735 

gac aaa ggc gac ate eta gca gat ggt ccg tec atg gaa aat ggt gaa 
Asp Lys Gly Asp He Leu Ala Asp Gly Pro Ser Met Glu Asn Gly Glu 

740 745 750 

atg gec ctt ggt aaa aac cca ttg att gee ttt ace acc ttt gat ggc 
Met Ala Leu Gly Lys Asn Pro Leu He Ala Phe Thr Thr Phe Asp Gly 

755 760 765 

tac aac ttc gag gat gee gtc att atg agt gaa cgt ttg gtc aaa gat 
Tyr Asn Phe Glu Asp Ala Val He Met Ser Glu Arg Leu Val Lys Asp 
770 775 780 

gac gtt tat acc tec ate cac att gaa gaa tat gaa tct gaa gec cgc 
Asp Val Tyr Thr Ser He His He Glu Glu Tyr Glu Ser Glu Ala Arg 
785 790 795 

gat acc aag tta ggg cca gaa gaa ate acc egg gaa att cca aac gtc 
Asp Thr Lys Leu Gly Pro Glu Glu He Thr Arg Glu He Pro Asn Val 
800 "* 805 810 815 

ggg gaa agt gee etc aag aac ttg gat gaa aga ggc att ate egg ate 
Gly Glu Ser Ala Leu Lys Asn Leu Asp Glu Arg Gly He He Arg lie 

820 825 830 

ggg get gaa gtt cgt gac ggg gac ate eta gtt ggt aaa gtt aca ccc 
Gly Ala Glu Val Arg Asp Gly Asp He Leu Val Gly Lys Val Thr Pro 

835 840 845 

aaa ggg gtt agt gaa eta tea get gag gaa aaa etc etc cac get ate 
Lys Gly Val Ser Glu Leu Ser Ala Glu Glu Lys Leu Leu His Ala He 
850 855 860 

ttc ggc gaa aaa gee egg gaa gtt cgt gac acc tec etc cgt gtc cca 
Phe Gly Glu Lys Ala Arg Glu Val Arg Asp Thr Ser Leu Arg Val Pro 
865 870 875 

cac ggt agt ggc gga att gtc cac gat gtc cag ate ttt acc egg gaa 
His Gly Ser Gly Gly He Val His Asp Val Gin He Phe Thr Arg Glu 
880 ^ 885 890 895 

gec ggc gac gaa ctg tea cca ggc gtt aac tac ctt gtc cga gtt ttc 
Ala Gly Asp Glu Leu Ser Pro Gly Val Asn Tyr Leu Val Arg Val Phe 

900 905 910 

att gec caa aaa cgt aaa att gac gtt ggg gac aag atg gca ggt cga 
He Ala Gin Lys Arg Lys He Asp Val Gly Asp Lys Met Ala Gly Arg 

915 920 925 

cac ggg aac aag ggt gtt gtt tec ctt ate tta cca gaa gaa gac atg 
His Gly Asn Lys Gly Val Val Ser Leu He Leu Pro Glu Glu Asp Met 
930 935 940 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



2832 
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ccg ttt atg cca gac gga acc cca att gac ate atg etc aac cca ctt 2880 
Pro Phe Met Pro Asp Gly Thr Pro lie Asp lie Met Leu Asn Pro Leu 
945 950 955 

ggt gtc cct tec egg atg aat gtc ggc cag gtc ate gaa etc cac atg 2928 
Gly Val Pro Ser Arg Met Asn Val Gly Gin Val lie Glu Leu His Met 
960 965 970 975 

ggg atg gca gee cga cag tta ggc gag cac att get act cca gtc ttt 297 6 

Gly Met Ala Ala Arg Gin Leu Gly Glu His lie Ala Thr Pro Val Phe 

980 985 990 



gac ggg gee aac gaa gaa gat gtt tgg gaa act ate aag gaa gee ggt 
Asp Gly Ala Asn Glu Glu Asp Val Trp Glu Thr lie Lys Glu Ala Gly 

995 1000 1005 



gga agg gta gac acc tat gaa gee att gtc aag ggc -caa cgc att cca 
Gly Arg Val Asp Thr Tyr Glu Ala He Val Lys Gly Gin Arg He Pro 
1105 ~ 1110 1115 



3024 



atg gat gca gat gec aaa acc gtc ttg tat gac ggc egg act ggc gag 3072 
Met Asp Ala Asp Ala Lys Thr Val Leu Tyr Asp Gly Arg Thr Gly Glu 
1010 1015 1020 

cca ttt gac aac aag gtc tec gtt ggg gtg atg tac ttt ate aaa eta 3120 
Pro Phe Asp Asn Lys Val Ser Val Gly Val Met Tyr Phe He Lys Leu 
1025 1030 1035 

gtc cac atg gtc gac gac aag ttg cac gee aga tec aca gga cca tac 3168 
Val His Met Val Asp Asp Lys Leu His Ala Arg Ser Thr Gly Pro Tyr 
1040 1045 1050 1055 

tec ttg gtt acc caa caa ccc ctt ggt ggg aaa gca cag ttt ggt ggc 3216 
Ser Leu Val Thr Gin Gin Pro Leu Gly Gly Lys Ala Gin Phe Gly Gly 

1060 1065 1070 

caa cgc ttt ggt gag atg gaa gtc tgg gec ttg gaa get tat ggg get 3264 
Gin Arg Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala 

1075 1080 1085 

tec cgc acc etc caa gaa ate ttg acc tac aag tea gat gac gtg att 3312 
Ser Arg Thr Leu Gin Glu He Leu Thr Tyr Lys Ser Asp Asp Val He 
1090 1095 1100 



3360 



aaa cct ggt gta cct gaa tec ttc cgt gtc etc gtg aaa gaa etc cag 3408 
Lys Pro Gly Val Pro Glu Ser Phe Arg Val Leu Val Lys Glu Leu Gin 
1120 1125 1130 1135 

tct ctg ggg ttg gac ctg aaa gtc etc gac aag gaa caa aac gaa ate 3456 
Ser Leu Gly Leu Asp Leu Lys Val Leu Asp Lys Glu Gin Asn Glu lie 

1140 1145 1150 

aat etc aag get gaa gat gac gag teg gaa gac caa gtc gtt gat tec 3504 
Asn Leu Lys Ala Glu Asp Asp Glu Ser Glu Asp Gin Val Val Asp Ser 

1155 1160 1165 

eta gaa gaa atg cgt aaa gag cag gaa gaa gaa cgc cgt aag gaa aaa 3552 
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Leu Glu Glu Met Arg Lys Glu Gin Glu Glu Glu Arg Arg Lys Glu Lys 
1170 1175 1180 

gaa aaa gaa gag cca agt act gag teg taa 3582 
Glu Lys Glu Glu Pro Ser Thr Glu Ser 
1185 1190 



<210> 14 
<211> 1192 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 14 

Met Asn Lys Leu Val Gly Lys Lys Val Asn Phe Gly Lys His Arg Val 
15 10 15 



Arg Arg Ser Tyr Ser Arg lie Asn Glu Val Leu Glu Leu Pro Asn Leu 

20 25 30 



lie Glu He Gin Thr Asp Ser Tyr Asp Trp Phe Leu Asp Glu Gly Leu 
35 40 45 



Lys Glu Met Phe Ser Asp He Ser Pro lie Asp Asp Phe Ser Gly Asn 
50 55 60 



Leu Ser Leu Glu Phe Val Asp Tyr Lys Phe Tyr Glu Ser Lys Tyr Thr 
65 70 75 80 



Val Glu Glu Ala Arg Glu His Asp Ala Asn Tyr Ser Ala Pro Leu Tyr 

85 90 95 



Val Lys Leu Arg Leu He Asn Lys Glu Thr Gly Glu Val Lys Glu Gin 

100 105 110 



Glu Val Phe Phe Gly Asp Phe Pro Leu Met Thr Glu Gin Gly Thr Phe 
115 120 125 



He He Asn Gly Ala Glu Arg Val He Val Ser Gin Leu Val Arg Ser 
130 135 140 



Pro Gly Val Tyr Tyr Ser Pro Lys Val Glu Lys Asn Gly Arg Glu Gly 
145 ~ 150 155 160 



Phe Ser Thr Val Leu He Pro Asn Arg Gly Ala Trp Leu Glu Tyr Glu 

165 170 175 
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Thr Asp Thr Lys Gly lie Ser Asn Val Arg lie Asp Arg Thr Arg Lys 

180 185 190 



lie Pro lie Thr Val Leu Leu Arg Ala Leu Gly lie Gly Ser Asp Asp 
195 200 205 



Glu lie lie Asp Leu lie Gly Leu Asn Asp Ser Leu Glu Ala Thr Leu 
210 215 220 



Glu Lys Asp Val His Lys Ser Thr Ser Asp Ser Arg Val Glu Glu Ala 
225 230 235 240 



Leu Lys Asp Leu Tyr Glu Arg Leu Arg Pro Gly Glu Pro Lys Thr Ala 

245 250 255 



Glu Ser Ser Arg Asn Leu lie Asn Thr Arg Phe Phe Asp His Lys Arg 

260 265 270 



Tyr Asp Leu Ala Tyr Val Gly Arg Tyr Lys Met Asn Lys Lys Leu Asp 
275 280 285 



Leu Lys Thr Arg Leu Met Gly Thr Val Leu Ala Glu Asn Leu Val Asp 
290 295 300 



Pro Glu Ala Gly Glu Val Leu Ala Glu Glu Gly Ser Glu Val Thr Arg 
305 310 315 320 



Ser Val Met Asp Lys Leu Gly Pro Tyr Leu Asp Gly Asp Met Asn Gin 

325 330 335 



Val Thr lie Asn Pro Ser Glu Glu Ala Val lie Pro Glu Pro lie Asp 

340 345 350 



Leu Gin lie Val Lys Val Tyr Ser Lys Glu Asp Pro Asp Arg lie Val 
355 360 365 



Asn Met lie Gly Asn Gly His Pro Asp Lys Lys Ala Lys Trp lie Thr 
370 ~ 375 380 



Pro Ala Asp Met He Ala Ala Met Ser Tyr Phe Phe Asn Leu Gin Glu 
385 390 395 400 
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Gly He Gly Asp Val Asp Asp He Asp His Leu Gly Asn Arg Arg He 

405 410 415 



Arg Ser Val Gly Glu Leu Leu Gin Asn Gin Phe Arg He Gly Leu Ser 

420 .425 430 



Arg Met Glu Arg Val Val Arg Glu Arg Met Ser He Gin Asp He Ser 
435 ■ 440 445 



Ser Thr Thr Pro Gin Gin Leu He Asn He Arg Pro Val Val Ala Ser 
450 455 460 



Leu Lys Glu Phe Phe Gly Ser Ser Gin Leu Ser Gin Phe Met Asp Gin 
465 470 • 475 480 



Thr Asn Pro Leu Gly Glu Leu Thr His Lys Arg Arg Leu Ser Ala Leu 

485 490 495 



Gly Pro Gly Gly Leu Thr Arg Asp Arg Ala Gly Tyr Glu Val Arg Asp 

500 505 510 



Val His Tyr Ser His Tyr Gly Arg Met Cys Pro He Glu Thr Pro Glu 
515 520 525 



Gly Pro Asn He Gly Leu He Asn Ser Leu Ser Thr Tyr Ala Lys He 
530 535 540 



Asn Lys Phe Gly Phe He Glu Thr Pro Tyr Arg Arg Val Asp Arg Glu 
545 550 555 560 



Thr Gly Gin Val Thr Asp Lys He Asp Tyr Leu Thr Ala Asp Glu Glu 

565 570 575 



Asp Leu Tyr Val Val Ala Gin Ala Asn Ala Glu Leu Asp Glu Asp Gly 

580 585 590 



His Phe Ala Asn Asp Val Val Leu Ala Arg Arg Arg Asp Val Asn Glu 
595 600 605 



Glu Val Asp Ala Ser Glu Val Asp Tyr Met Asp Val Ser Pro Lys Gin 
610 615 620 



Val Val Ser Val Ala Thr Ala Ser He Pro Phe Leu Glu Asn Asp Asp 



625 
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630 635 640 



Ser Asn Arg Ala Leu Met Gly Ala Asn Met Gin Arg Gin Ala Val Pro 

645 650 655 

Leu Met Gin Pro Glu Ser Pro Leu Val Gly Thr Gly lie Glu His lie 

660 665 670 

Ala Ala Arg Asp Ser Gly Ala Ala Val lie Ala Lys Ala Asp Gly Val 
675 ' 680 685 

Val Glu Tyr Val Asp Ala Lys Thr Val Lys Val Arg Gin Ala Asp Gly 
690 695 700 

Thr Leu Asn Asn Tyr Lys Leu Ala Lys Tyr Lys Arg Ser Asn Ser Gly 
705 710 715 720 

Thr Ser Tyr Asn Gin Arg Pro He Val Lys Thr Gly Glu Glu Val Asp 

725 730 735 

Lys Gly Asp lie Leu Ala Asp Gly Pro Ser Met Glu Asn Gly Glu Met 

740 745 750 



Ala Leu Gly Lys Asn Pro Leu He Ala Phe Thr Thr Phe Asp Gly Tyr 
755 760 765 

Asn Phe Glu Asp Ala Val He Met Ser Glu Arg Leu Val Lys Asp Asp 
770 775 780 

Val Tyr Thr Ser lie His He Glu Glu Tyr Glu Ser Glu Ala Arg Asp 
785 790 795 800 

Thr Lys Leu Gly Pro Glu Glu He Thr Arg Glu lie Pro Asn Val Gly 

805 810 815 



Glu Ser Ala Leu Lys Asn Leu Asp Glu Arg Gly He He Arg He Gly 

820 825 830 



Ala Glu Val Arg Asp Gly Asp Xle Leu Val Gly Lys Val Thr Pro Lys 
835 840 845 



Gly Val Ser Glu Leu Ser Ala Glu Glu Lys Leu Leu His Ala lie Phe 
850 855 860 
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Gly Glu Lys Ala Arg Glu Val Arg Asp Thr Ser Leu Arg Val Pro His 
865 870 875 880 

Gly Ser Gly Gly lie Val His Asp Val Gin lie Phe Thr Arg Glu Ala 

885 890 895 



Gly Asp Glu Leu Ser Pro Gly Val Asn Tyr Leu Val Arg Val Phe lie 

900 905 910 



Ala Gin Lys Arg Lys lie Asp Val Gly Asp Lys Met Ala Gly Arg His 
915 920 925 



Gly Asn Lys Gly Val Val Ser Leu lie Leu Pro Glu Glu Asp Met Pro 
930 935 940 

Phe Met Pro Asp Gly Thr Pro He Asp lie Met Leu Asn Pro Leu Gly 
945 950 955 960 

Val Pro Ser Arg Met Asn Val Gly Gin Val He Glu Leu His Met Gly 

965 970 975 



Met Ala Ala Arg Gin Leu Gly Glu His He Ala Thr Pro Val Phe Asp 

980 985 990 



Gly Ala Asn Glu Glu Asp Val Trp Glu Thr He Lys Glu Ala Gly Met 
995 1000 1005 



Asp Ala Asp Ala Lys Thr Val Leu Tyr Asp Gly Arg Thr Gly Glu Pro 
1010 1015 1020 

Phe Asp Asn Lys Val Ser Val Gly Val Met Tyr Phe He Lys Leu Val 
1025 1030 1035 1040 



His Met Val Asp Asp Lys Leu His Ala Arg Ser Thr Gly Pro Tyr Ser 

1045 1050 1055 



Leu Val Thr Gin Gin Pro Leu Gly Gly Lys Ala Gin Phe Gly Gly Gin 

1060 1065 1070 



Arg Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala Ser 
1075 1080 1085 
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Arg Thr Leu Gin Glu lie Leu Thr Tyr Lys Ser Asp Asp Val lie Gly 
1090 1095 1100 

Arg Val Asp Thr Tyr Glu Ala lie Val Lys Gly Gin Arg He Pro Lys 
1105 1110 1120 

Pro Gly Val Pro Glu Ser Phe Arg Val Leu Val Lys Glu Leu Gin Ser 

1125 1130 1135 

Leu Gly Leu Asp Leu Lys Val Leu Asp Lys Glu Gin. Asn Glu He Asn 

1140 1145 1150 

Leu Lys Ala Glu Asp Asp Glu Ser Glu Asp Gin Val Val Asp Ser Leu 
1155 H60 1165 

Glu Glu Met Arg Lys Glu Gin Glu Glu Glu Arg Arg Lys Glu Lys Glu 
1170 1175 1180 



Lys Glu Glu Pro Ser Thr Glu Ser 
1185 1190 



<210> 15 
<211> 1407 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25) . . (1407) 

<223> 

<400> 15 

aaagaccagg aaaggaagaa gacc ttg gca act aat att cat gaa gac cgc 

Met Ala Thr Asn He His Glu Asp Arg 
1 5 

ctg cca cca caa aat att gaa gcg gag caa tec gtc tta ggg tec gtc 
Leu Pro Pro Gin Asn He Glu Ala Glu Gin Ser Val Leu Gly Ser Val 
10 15 20 25 



51 



99 



etc tta aat gca gaa gec ttg gtg gcg gec atg gaa tat gtg gat gaa 147 
Leu Leu Asn Ala Glu Ala Leu Val Ala Ala Met Glu Tyr Val Asp Glu 

30 35 40 

gat gac ttt tac egg egg gec cac cag ttg ate ttt aag gec atg ata 195 
Asp Asp Phe Tyr Arg Arg Ala His Gin Leu He Phe Lys Ala Met He 

45 50 55 



gac 



etc tat gaa gac aac cag gec att gat gtc att acc att aaa gac 



243 
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Asp Leu Tyr Glu Asp Asn Gin Ala He Asp Val He Thr He Lys Asp 
60 65 70 

aag ctg gaa gcc aat gac cag ttg gag gat ate ggg ggt gec tct tac 
Lys Leu Glu Ala Asn Asp Gin Leu Glu Asp He Gly Gly Ala Ser Tyr 
75 80 85 

eta get gag att get ggg gtc ace cca ace gca get aac gtg tec tat 
Leu Ala Glu He Ala Gly Val Thr Pro Thr Ala Ala Asn Val Ser Tyr 
90 95 100 105 

tac get aag att gtg gaa gat egg tct ctt ttg cgc aac ttg att gcg 
Tyr Ala Lys He Val Glu Asp Arg Ser Leu Leu Arg Asn Leu He Ala 

110 115 120 

aca get aat gag att gcc cag tct ggc tac gaa gac cat gac gat gtg 
Thr Ala Asn Glu He Ala Gin Ser Gly Tyr Glu Asp His Asp Asp Val 

125 130 135 

cca gaa gtt tta aac aat get gag cag aag ate ttg cag gtt tct gaa 
Pro . Glu Val Leu Asn Asn Ala Glu Gin Lys He Leu Gin Val Ser Glu 
140 145 150 



ace ate gag cat att gat gaa etc cac caa agg gat gaa gag ate ace 
Thr He Glu His He Asp Glu Leu His Gin Arg Asp Glu Glu He Thr 
170 175 180 185 

ggg att tea act ggc tac ccc tac ctg gac agg atg act tea ggc ctt 
Gly He Ser Thr Gly Tyr Pro Tyr Leu Asp Arg Met Thr Ser Gly Leu 

190 195 200 

cat gaa gat gag ttg att att gtc gca gca aga ccg ggt gtg ggg aaa 
His Glu Asp Glu Leu He He Val Ala Ala Arg Pro Gly Val Gly Lys 

205 210 215 

acg get ttt gcc ttg aat gtc gcc caa aat ate ggg aca gcc aca gat 
Thr Ala Phe Ala Leu Asn Val Ala Gin Asn He Gly Thr Ala Thr Asp 
220 225 230 



aac egg atg tta tgt tea gaa ggc agt att gat gcc act aac etc cga 
Asn Arg Met Leu Cys Ser Glu Gly Ser He Asp Ala Thr Asn Leu Arg 
250 255 260 265 

aat ggc aag eta acg ccg gaa gaa tat gac cgt ttg ttt gtg gcc atg 
Asn Gly Lys Leu Thr Pro Glu Glu Tyr Asp Arg Leu Phe Val Ala Met 

270 275 280 



291 



339 



387 



435 



483 



aaa cga aac egg acc ggc ttt get agt att tea gaa ate etc cac caa 531 
Lys Arg Asn Arg Thr Gly Phe Ala Ser He Ser Glu He Leu His Gin 
155 160 165 



579 



627 



675 



723 



gaa act att gcg att ttt tec ctt gag atg ggg get gaa cag ctg gtc 771 
Glu Thr He Ala He Phe Ser Leu Glu Met Gly Ala Glu Gin Leu Val 
235 240 245 



819 



867 



ggg age ttg tct gaa get gat att tat att gat gac act ccc ggc ate 
Gly Ser Leu Ser Glu Ala Asp He tyr He Asp Asp Thr Pro Gly He 



915 
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285 290 295 

egg aca get gaa ate egg gee aag tgc cgc cgc ctg gtc caa gag aag 
Arg Thr Ala Glu lie Arg Ala Lys Cys Arg Arg Leu Val Gin Glu Lys 
300 305 310 



get tea aac tat gaa tec aga cag cag cag gtg tct gat ata tct egg 
Ala Ser Asn Tyr Glu Ser Arg Gin Gin Gin Val Ser Asp He Ser Arg 
330 335 340 345 

cag ctg aag aag ctt tct aag gaa ctt tct gtc cca gtt att gec ctg 
Gin Leu Lys Lys Leu Ser Lys Glu Leu Ser Val Pro Val He Ala Leu 

350 355 360 

tea caa ctg tec egg agt gtg gaa cag aga caa gac aag egg ccc ate 
Ser Gin Leu Ser Arg Ser Val Glu Gin Arg Gin Asp Lys Arg Pro He 

365 370 375 

etc agt gac ttg egg gaa tea ggg teg att gaa cag gat gee gat att 
Leu Ser Asp Leu Arg Glu Ser Gly Ser He Glu Gin Asp Ala Asp He 
380 385 390 

gtg gee ttc ctt tac egg gag gac tac tac caa aat gaa gaa gat ate 
Val Ala Phe Leu Tyr Arg Glu Asp Tyr Tyr Gin Asn Glu Glu Asp He 
395 400 405 

gat gag gac ttt gtc gat aat age gtg gaa gtc att ate gaa aaa aac 
Asp Glu Asp Phe Val Asp Asn Ser Val Glu Val He He Glu Lys Asn 
410 415 420 425 

egg tea gga get cga gga aca gtc aag ttg aac ttt aag aaa gag ttc 
Arg Ser Gly Ala Arg Gly Thr Val Lys Leu Asn Phe Lys Lys Glu Phe 

430 435 440 

aac aaa ttt ace teg att tct tac egg tct gaa gat gaa gtc cca gec 
Asn Lys Phe Thr Ser He Ser Tyr Arg Ser Glu Asp Glu Val Pro Ala 

445 450 455 

aac ttt ggc tag 
Asn Phe Gly 
460 



963 



gga agt ctg ggc ttg att gtc att gac tac ctg caa ttg ate gaa gga 1011 
Gly Ser Leu Gly Leu He Val He Asp Tyr Leu Gin Leu He Glu Gly 
315 320 325 



1059 



1107 



1155 



1203 



1251 



1299 



1347 



1395 



1407 



<210> 16 
<211> 460 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 16 

Met Ala Thr Asn He His Glu Asp Arg Leu Pro Pro Gin Asn He Glu 
15 10 15 
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Ala Glu Gin Ser Val Leu Gly Ser Val Leu Leu Asn Ala Glu Ala Leu 

20 25 30 

Val Ala Ala Met Glu Tyr Val Asp Glu Asp Asp Phe Tyr Arg Arg Ala 
35 40 45 

His Gin Leu lie Phe Lys Ala Met He Asp Leu Tyr Glu Asp Asn Gin 
50 55 60 

Ala He Asp Val He Thr He Lys Asp Lys Leu Glu Ala Asn Asp Gin 
65 70 75 80 

Leu Glu Asp lie Gly Gly Ala Ser Tyr Leu Ala Glu He Ala Gly Val 

85 90 95 



Thr Pro Thr Ala Ala Asn Val Ser Tyr Tyr Ala Lys He Val Glu Asp 

100 105 HO 

Arg Ser Leu Leu Arg Asn Leu He Ala Thr Ala Asn Glu He Ala Gin 
115 120 125 

Ser Gly Tyr Glu Asp His Asp Asp Val Pro Glu Val Leu Asn Asn Ala 
130 ~ 135 - 140 

Glu Gin Lys He Leu Gin Val Ser Glu Lys Arg Asn Arg Thr Gly Phe 
145 " 150 155 160 

Ala Ser He Ser Glu He Leu His Gin Thr He Glu His He Asp Glu 

165 170 175 



Leu His Gin Arg Asp Glu Glu He Thr Gly He Ser Thr Gly Tyr Pro 

180 185 190 

Tyr Leu Asp Arg Met Thr Ser Gly Leu His Glu Asp Glu Leu He He 
195 200 205 



Val Ala Ala Arg Pro Gly Val Gly Lys Thr Ala Phe Ala Leu Asn Val 

210 215 220 

Ala Gin Asn He Gly Thr Ala Thr Asp Glu Thr He Ala He Phe Ser 
225 230 235 240 



Leu Glu Met Gly Ala Glu Gin Leu Val Asn Arg Met Leu Cys Ser Glu 
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245 



250 255 



Gly Ser lie Asp Ala Thr Asn Leu Arg Asn Gly Lys Leu Thr Pro Glu 

260 265 270 

Glu Tyr Asp Arg Leu Phe Val Ala Met Gly Ser Leu Ser Glu Ala Asp 
275 280 285 

lie Tyr lie Asp Asp Thr Pro Gly lie Arg Thr Ala Glu lie Arg Ala 



290 295. 300 



Lys Cys Arg Arg Leu Val Gin Glu Lys Gly Ser Leu Gly Leu He Val 
305 310 315 320 

He Asp Tyr Leu Gin Leu He Glu Gly Ala Ser Asn Tyr Glu Ser Arg 

325 330 335 

Gin Gin Gin Val Ser Asp He Ser Arg Gin Leu Lys Lys Leu Ser Lys 

340 345 350 

Glu Leu Ser Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Ser Val 
355 360 365 

Glu Gin Arg Gin Asp Lys Arg Pro He Leu Ser Asp Leu Arg Glu Ser 
370 375 380 

Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg Glu 
385 390 395 400 

Asp Tyr Tyr Gin Asn Glu Glu Asp He Asp Glu Asp Phe Val Asp Asn 

405 410 415 

Ser Val Glu Val He He Glu Lys Asn Arg Ser Gly Ala Arg Gly Thr 

420 425 430 

Val Lys Leu Asn Phe Lys Lys Glu Phe Asn Lys Phe Thr Ser He Ser 
435 440 445 

Tyr Arg Ser Glu Asp Glu Val Pro Ala Asn Phe Gly 
450 455 460 



<210> 17 
<211> 2484 
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<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (10) . - (2484) 

<223> 



99 



147 



195 



243 



291 



aggaggiL ttg ttt ttg gaa gag aga gat age cgt tta gaa cag att aag 51 
Met Phe Leu Glu Glu Arg Asp Ser Arg Leu Glu Gin He Lys 
1 5 10 

ctg tec aag gag atg aaa aac tea ttc tta gac tat gcc atg agt gtc 
Leu Ser Lys Glu Met Lys Asn Ser Phe Leu Asp Tyr Ala Met Ser Val 
15 20 25 30 

ate gtc tec egg gcc eta ccc gat gtc egg gac ggc ttg aag ccg gtt 
He Val Ser Arg Ala Leu Pro Asp Val Arg Asp Gly Leu Lys Pro Val 

35 40 45 

cac cga aga ate ctg tac gga atg aat gaa ctg ggc tta ace ccg gac 
His Arg Arg He Leu Tyr Gly Met Asn Glu Leu Gly Leu Thr Pro Asp 

50 55 60 

aag tct tat aaa aag tct gcc cgt att gta ggg gat gtt atg ggg aaa 
Lys Ser Tyr Lys Lys Ser Ala Arg He Val Gly Asp Val Met Gly Lys 
65 70 75 

tac cac ccc cac ggt gac act get att tat gac tec atg gtc aga atg 
Tyr His Pro His Gly Asp Thr Ala He Tyr Asp Ser Met Val Arg Met 
80 85 90 

gcc cag gac ttt tct tac cga gtt ccc tta gtg gac ggc cat ggg aac 
Ala Gin Asp Phe Ser Tyr Arg Val Pro Leu Val Asp Gly His Gly Asn 
95 100 105 HO 

ttt ggg teg gtt gac ggg gac gga get get gcc atg egg tat ace gaa 
Phe Gly Ser Val Asp Gly Asp Gly Ala Ala Ala Met Arg Tyr Thr Glu 

115 120 125 

gcc egg atg tec aag atg gcc ttg gaa etc ctg cga gac ate aac aag 
Ala Arg Met Ser Lys Met Ala Leu Glu Leu Leu Arg Asp He Asn Lys 

130 135 140 

gat ace att gac tac cac gat aac tat gat ggg act gag teg gaa ccc 
Asp Thr He Asp Tyr His Asp Asn Tyr Asp Gly Thr Glu Ser Glu Pro 
145 150 155 

gat ate ctt cct gcc cgc ttc ccc aac etc tta gtc aac ggg get teg 531 
Asp He Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser 
160 165 170 

ggg att get gtt ggg atg gca acc aat ate cca ccc cac aat ctt aag 579 
Gly He Ala Val Gly Met Ala Thr Asn He Pro Pro His Asn Leu Lys 
175 180 185 190 



339 



387 



435 



483 
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gaa gtc att gat gcc tgc gtc etc ttg atg gaa aat gag gat gtg act 
Glu Val lie Asp Ala Cys Val Leu Leu Met Glu Asn Glu Asp Val Thr 

195 200 205 

gtg get gac ctt atg gaa gtc tta cca gga cct gac ttt ccg act ggg 
Val Ala Asp Leu Met Glu Val Leu Pro Gly Pro Asp Phe Pro Thr Gly 

210 215 220 



627 



aaa ggt aag gaa aga att att ate gac gaa att cct tac atg gtc aac 
Lys Gly Lys Glu Arg lie lie lie Asp Glu lie Pro Tyr Met Val Asn 
255 260 265 270 

aag gcc aaa ttg gtc gaa aag att gcg gaa ctg get egg gac aag aaa 
Lys Ala Lys Leu Val Glu Lys lie Ala Glu Leu Ala Arg Asp Lys Lys 

275 280 285 

att gac ggc att ace gat tta aat gat gag tct gac egg gaa ggc ttg 
He Asp Gly He Thr Asp Leu Asn Asp Glu Ser Asp Arg Glu Gly Leu 

290 295 300 

egg att gtg ate gat gta cgc egg gat act tct get ggt ata tta ctt 
Arg He Val He Asp Val Arg Arg Asp Thr Ser Ala Gly He Leu Leu 
305 310 315 

aac aag ctt tac aaa atg ace caa ttg cag gtt tct ttt ggc ttt aac 
Asn Lys Leu Tyr Lys Met Thr Gin Leu Gin Val Ser Phe Gly Phe Asn 
320 325 330 

atg ctg get ate gtc gat ggg gtg ccc aaa acc ttg ggc etc aaa gac 
Met Leu Ala He Val Asp Gly Val Pro Lys Thr Leu Gly Leu Lys Asp 
335 340 345 350 

ate ctg acc cac tac tta gac cat caa aaa act gtt ate cgc agg egg 
He Leu Thr His Tyr Leu Asp His Gin Lys Thr Val He Arg Arg Arg 

355 360 365 



ggg ctt egg act gcc tta gac cat ate gat gcc att att acc att ate 
Gly Leu Arg Thr Ala Leu Asp His He Asp Ala He He Thr He He 
385 390 395 



675 



get tec ctt att ggt gtt tct ggc gtc cgc aag get tat gag acc ggt 723 
Ala Ser Leu He Gly Val Ser Gly Val Arg Lys Ala Tyr Glu Thr Gly 
225 230 235 

cgt ggg tec att aaa tta egg gcc aag tec egg ate gat gtc gac caa 771 
Arg Gly Ser He Lys Leu Arg Ala Lys Ser Arg He Asp Val Asp Gin 
240 245 250 



819 



867 



915 



963 



1011 



1059 



1107 



aca gag ttt gac aag aac aag get gaa teg egg gcc cac ate tta gaa 1155 
Thr Glu Phe Asp Lys Asn Lys Ala Glu Ser Arg Ala His He Leu Glu 

370 375 380 



12 03 



cgt cag tec cag caa get gaa gaa gcc aaa agt caa ttg atg get tct 1251 
Arg Gin Ser Gin Gin Ala Glu Glu Ala Lys Ser Gin Leu Met Ala Ser 
400 405 410 



tat gac etc tct gac cgt caa gcc cag gcg att tta gac atg egg atg 



1299 
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ctg tct cga aaa ggc tat ate aaa egg atg ccg get gga gaa ttc aag 
Leu Ser Arg Lys Gly Tyr He Lys Arg Met Pro Ala Gly Glu Phe Lys 

515 52 0 525 



tta aat ata gat aaa gat gaa tat att caa gee atg gtc aac ttg act 
Leu Asn He Asp Lys Asp Glu Tyr He Gin Ala Met Val Asn Leu Thr 

595 600 605 

gac cag gca gat gac cag gac caa ttc ttc ttt gcg aca aga ctt ggc 
Asp Gin Ala Asp Asp Gin Asp Gin Phe Phe Phe Ala Thr Arg Leu Gly 

610 615 620 

egg gtc aaa egg acg gee cag tct gaa ttt caa aat ate aga agt age 
Arg Val Lys Arg Thr Ala Gin Ser Glu Phe Gin Asn He Arg Ser Ser 
625 630 635 



1395 



Tyr Asp Leu Ser Asp Arg Gin Ala Gin Ala He Leu Asp Met Arg Met 
415 420 425 430 

gtc egg ttg act ggt ttg gaa aga gag aaa att gaa gat gaa tac get 1347 
Val Arg Leu Thr Gly Leu Glu Arg Glu Lys He Glu Asp Glu Tyr Ala 

435 440 445 

gaa etc tta gaa aaa ate gag gac ttg cgt gac ate ttg gee egg cca 
Glu Leu Leu Glu Lys He Glu Asp . Leu Arg Asp He Leu Ala Arg Pro 

450 455 460 

gaa egg ate aag caa att ate aaa gaa gaa atg ate gaa att get gaa 
Glu Arg He Lys Gin He He Lys Glu Glu Met He Glu He Ala Glu 
465 470 475 

aaa cac ggc caa gac cgc eta act gac ate egg gtt ggg gaa gag ttg 
Lys His Gly Gin Asp Arg Leu Thr Asp He Arg Val Gly Glu Glu Leu 
480 485 490 



1443 



1491 



age att gaa gac gaa gac ttg att gaa gaa gaa gat ate ate att ace 1539 
Ser He Glu Asp Glu Asp Leu He Glu Glu Glu Asp He He He Thr 
495 " 500 505 510 



1587 



gee caa aac cgc ggt ggc cgt ggg gtt aag ggg atg act acc aac gat 1635 
Ala Gin Asn Arg Gly Gly Arg Gly Val Lys Gly Met Thr Thr Asn Asp 

530 535 540 



1683 



ggg gac ttt gtt gaa cag ctg act ttc tgt tct agt cat gac caa ate 
Gly Asp Phe Val Glu Gin Leu Thr Phe Cys Ser Ser His Asp Gin He 
545 550 555 

etc ttc ttt acc aac caa ggc aag gtt tat aag ate aag gee tac gaa 1731 
Leu Phe Phe Thr Asn Gin Gly Lys Val Tyr Lys He Lys Ala Tyr Glu 
560 565 570 

ate ccg gaa tat ggg cgt aat gee aag gga att cct gec ate aac ttt 1779 
He Pro Glu Tyr Gly Arg Asn Ala Lys Gly He Pro Ala He Asn Phe 
575 "* 580 585 590 



1827 



1875 



1923 



ggg ttg aac gcg ate aat eta aat gaa ggc gat gaa ttg gtt aac gtg 1971 
Gly Leu Asn Ala He Asn Leu Asn Glu Gly Asp Glu Leu Val Asn Val 
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640 645 650 

gtc cct acc cac aat gac cag gcc att ate ctg gec age cag caa ggc 

Val Pro Thr His Asn Asp Gin Ala He He Leu Ala Ser Gin Gin Gly 
655 660 665 670 

tat gcg gtc tac ttt gat gaa aaa gat ate cgt age atg ggt cga ggg 

Tyr Ala Val Tyr Phe Asp Glu Lys Asp He Arg Ser Met Gly Arg Gly 

675 ' 680 685 



cga ggg ggc aag ggg gtt aaa acc ctt cat att acc gat aag aat ggt 
Arg Gly Gly Lys Gly Val Lys Thr Leu His He Thr Asp Lys Asn Gly 
735 " 740 745 750 

ccc eta att gga ctg aaa act gtc tct ggt ggt gag gac gtc atg att 
Pro Leu He Gly Leu Lys Thr Val Ser Gly Gly Glu Asp Val Met He 

755 760 765 

gtc acc gac caa ggt ate atg att cgt ate gaa gcc gac age ate tct 
Val Thr Asp Gin Gly He Met He Arg He Glu Ala Asp Ser He Ser 

770 775 780 

cag acc tec cgc eta acc caa ggt gtc cgt tta att cga ctt gaa gaa 
Gin Thr Ser Arg Leu Thr Gin Gly Val Arg Leu He Arg Leu Glu Glu 
785 790 795 



gac aat caa gtt aac caa acc gtt gag gaa taa 
Asp Asn Gin Val Asn Gin Thr Val Glu Glu 
815 820 



2019 



2067 



get gca ggt gtc cgt gga att cgc tta ggt gat ggc gac aca gtg gtt 2115 
Ala Ala Gly Val Arg Gly He Arg Leu Gly Asp Gly Asp Thr Val Val 

690 695 700 

gcc atg gaa gtc tta gag ccg ggc caa gac gta tta gtc att act gaa 2163 
Ala Met Glu Val Leu Glu Pro Gly Gin Asp Val Leu Val He Thr Glu 
705 710 715 

aaa ggg tac ggc aaa cga acc tec caa gaa gag tac acc etc cac aag 2211 
Lys Gly Tyr Gly Lys Arg Thr Ser Gin Glu Glu Tyr Thr Leu His Lys 
720 ~ ^ 725 730 



2259 



2307 



2355 



2403 



gat age egg gtg tea acg gta gcc etc att gat att gac caa gag ctt 2451 
Asp Ser Arg Val Ser Thr Val Ala Leu He Asp He Asp Gin Glu Leu 
800 805 810 



2484 



<210> 18 
<211> 824 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 18 

Met Phe Leu Glu Glu Arg Asp Ser Arg Leu Glu Gin He Lys Leu Ser 
1 5 10 15 
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Lvs Glu Met Lys Asn Ser Phe Leu Asp Tyr Ala Met Ser Val lie Val 

20 25 30 

Ser Arg Ala Leu Pro Asp Val Arg Asp Gly Leu Lys Pro Val His Arg 
35 40 45 

Arg He Leu Tyr Gly Met Asn Glu Leu Gly Leu Thr Pro Asp Lys Ser 
50 55 60 

Tyr Lys Lys Ser Ala Arg He Val Gly Asp Val Met Gly. Lys Tyr His 
65 70 75 80 

Pro His Gly Asp Thr Ala He Tyr Asp Ser Met Val Arg Met Ala Gin 

85 90 95 

Asp Phe Ser Tyr Arg Val Pro Leu Val Asp Gly His Gly Asn Phe Gly 

100 105 HO 

Ser Val Asp Gly Asp Gly Ala Ala Ala Met Arg Tyr Thr Glu Ala Arg 
115 120 125 

Met Ser Lys Met Ala Leu Glu Leu Leu Arg Asp He Asn Lys Asp Thr 
130 " 135 ' 140 

He Asp Tyr His Asp Asn Tyr Asp Gly Thr Glu Ser Glu Pro Asp He 
145 150 155 160 

Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser Gly He 

165 170 175 

Ala Val Gly Met Ala Thr Asn He Pro Pro His Asn Leu Lys Glu Val 

180 185 190 

He Asp Ala Cys Val Leu Leu Met Glu Asn Glu Asp Val Thr Val Ala 
195 200 205 

Asp Leu Met Glu Val Leu Pro Gly Pro Asp Phe Pro Thr Gly Ala Ser 
210 215 220 

Leu He Gly Val Ser Gly Val Arg Lys Ala Tyr Glu Thr Gly Arg Gly 
225 230 235 240 

Ser He Lys Leu Arg Ala Lys Ser Arg He Asp Val Asp Gin Lys Gly 
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245 250 255 

Lys Glu Arg lie lie He Asp Glu He Pro Tyr Met Val Asn Lys Ala 

260 265 270 

Lys Leu Val Glu Lys He Ala Glu Leu Ala Arg Asp Lys Lys He Asp 
275 280 285 

Gly He Thr Asp Leu Asn Asp Glu Ser Asp Arg Glu Gly Leu Arg He 
290 295 300 

Val He Asp Val Arg Arg Asp Thr Ser Ala Gly He Leu Leu Asn Lys 
305 310 315 320 

Leu Tyr Lys Met Thr Gin Leu Gin Val Ser Phe Gly Phe Asn Met Leu 

325 330 335 

Ala He Val Asp Gly Val Pro Lys Thr Leu Gly Leu Lys Asp He Leu 

340 345 350 

Thr His Tyr Leu Asp His Gin Lys Thr Val He Arg Arg Arg Thr Glu 
- 355 360 365 

Phe Asp Lys Asn Lys Ala Glu Ser Arg Ala His He Leu Glu Gly Leu 
370 375 380 

Arg Thr Ala Leu Asp His He Asp Ala He He Thr He He Arg Gin 
385 390 395 400 

Ser Gin Gin Ala Glu Glu Ala Lys Ser Gin Leu Met Ala Ser Tyr Asp 

405 410 415 

Leu Ser Asp Arg Gin Ala Gin Ala He Leu Asp Met Arg Met Val Arg 

420 425 430 

Leu Thr Gly Leu Glu Arg Glu Lys He Glu Asp Glu Tyr Ala Glu Leu 
435 440 445 

Leu Glu Lys He Glu Asp Leu Arg Asp He Leu Ala Arg Pro Glu Arg 
450 ** 455 460 

He Lys Gin He He Lys Glu Glu Met He Glu He Ala Glu Lys His 
465 470 475 480 
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Gly Gin Asp Arg Leu Thr Asp lie Arg Val Gly Glu Glu Leu Ser He 

485 490 495 

Glu Asp Glu Asp Leu He Glu Glu Glu Asp He He He Thx Leu Ser 

500 505 510 

Arg Lys Gly Tyr He Lys Arg Met Pro Ala Gly Glu Phe Lys Ala Gin 
515 520 525 

Asn Arg Gly Gly Arg Gly Val Lys Gly Met Thr Thr Asn Asp Gly Asp 
530 535 540 

Phe Val Glu Gin Leu Thr Phe Cys Ser Ser His Asp Gin He Leu Phe 
545 550 555 560 

Phe Thr Asn Gin Gly Lys Val Tyr Lys He Lys Ala Tyr Glu He Pro 

565 570 575 

Glu Tyr Gly Arg Asn Ala Lys Gly He Pro Ala He Asn Phe Leu Asn 

580 585 590 

He Asp Lys Asp Glu Tyr He Gin Ala Met Val Asn Leu Thr Asp Gin 
595 600 605 

Ala Asp Asp Gin Asp Gin Phe Phe Phe Ala Thr Arg Leu Gly Arg Val 
610 615 620 

Lys Arg Thr Ala Gin Ser Glu Phe Gin Asn He Arg Ser Ser Gly Leu 
625 630 635. 640 

Asn Ala He Asn Leu Asn Glu Gly Asp Glu Leu Val Asn Val Val Pro 

645 650 655 

Thr His Asn Asp Gin Ala He He Leu Ala Ser Gin Gin Gly Tyr Ala 

660 665 670 

Val Tyr Phe Asp Glu Lys Asp He Arg Ser Met Gly Arg Gly Ala Ala 
675 680 685 

Gly Val Arg Gly He Arg Leu Gly Asp Gly Asp Thr Val Val Ala Met 
690 695 700 
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Glu Val Leu Glu Pro Gly Gin Asp Val Leu Val He Thr Glu Lys Gly 
705 



710 715 720 



Tyr Gly Lys Arg Thr Ser Gin Glu Glu Tyr Thr Leu His Lys Arg Gly 

725 730 

Gly Lys Gly Val Lys Thr Leu His He Thr Asp Lys Asn Gly Pro Leu 

740 745 750 

He Gly Leu Lys Thr Val Ser Gly Gly Glu Asp Val Met He Val Thr 
755 760 765 

Asp Gin Gly He Met He Arg He Glu Ala Asp Ser He Ser Gin Thr 
770 775 780 

Ser Arg Leu Thr Gin Gly Val Arg Leu He Arg Leu Glu Glu Asp Ser 
785 790 795 800 

Arg Val Ser Thr Val Ala Leu He Asp He Asp Gin Glu Leu Asp Asn 

805 810 815 



Gin Val Asn Gin Thr Val Glu Glu 

820 



<210> 19 
<211> 1956 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . - (1956) 

<223> 

cgtgta 1 atg get gaa gat aga cca tta aca cca aat gag tta gca gaa 
Met Ala Glu Asp Arg Pro Leu Thr Pro Asn Glu Leu Ala Glu 
1 5 10 

ctg aaa aaa aca tat gat get agt caa ate caa gtc tta gaa ggc eta 
Leu Lys Lys Thr Tyr Asp Ala Ser Gin He Gin Val Leu Glu Gly Leu 
15 20 25 30 

gaa gca gtc aga gta egg ccg ggt atg tac att ggg tec ace age aag 
Glu Ala Val Arg Val Arg Pro Gly Met Tyr He Gly Ser Thr Ser Lys 

35 40 45 

gaa ggc etc cac cac ttg gta tgg gag ate gtg gac aat get att gac 



48 



96 



144 



192 
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Glu Gly Leu His His Leu Val Trp Glu lie Val Asp Asn Ala lie Asp 

50 55 60 

gaa get atg gec ggt tat gec gac aag att tct gtt tec att ttg gaa 
Glu Ala Met Ala Gly Tyr Ala Asp Lys lie Ser Val Ser He Leu Glu 
65 70 75 

ggc gac gtg ate caa gtg get gat aac ggc egg ggc ate ccg gtt gat 
Gly Asp Val He Gin Val Ala Asp Asn Gly Arg Gly He Pro Val Asp 
80 85 90 

ate cag gaa aaa aca ggc egg cca get gtt gaa act gtc ttt aca gtc 
He Gin Glu Lys Thr Gly Arg Pro Ala Val Glu Thr Val Phe Thr Val 
95 100 105 HO 

etc cac get ggt ggg aaa ttt ggt ggc ggt ggt tac aag gtt tec ggt 
Leu His Ala Gly Gly Lys Phe Gly Gly Gly Gly Tyr Lys Val Ser Gly 

115 120 125 

ggt ctg cac ggt gta ggg tct tct gtg gtc aat get etc tec gaa tac 
Gly Leu His Gly Val Gly Ser Ser Val Val Asn Ala Leu Ser Glu Tyr 

130 135 140 

etc caa gtc cag gtg cac cga gat ggt aaa ate tac caa caa gtt tac 
Leu Gin Val Gin Val His Arg Asp Gly Lys He Tyr Gin Gin Val Tyr 
145 150 155 

aag egg ggc ttg gtt gat tct gac ttg gaa gtg gtg ggt gag act gac 
Lys Arg Gly Leu Val Asp Ser Asp Leu Glu Val Val Gly Glu Thr Asp 
160 165 170 

cac act gga act att gtt ace ttt aag gca gat agt ttg att ttt aaa 
His Thr Gly Thr He Val Thr Phe Lys Ala Asp Ser Leu He Phe Lys 
175 180 185 190 

gac act act tct tat gac ttc aat acc tta gec acc egg ate egg gag 
Asp Thr Thr Ser Tyr Asp Phe Asn Thr Leu Ala Thr Arg He Arg Glu 

195 200 205 

ttg gee ttc tta aac cga ggc ttg aat att tec ate gaa gac aaa egg 
Leu Ala Phe Leu Asn Arg Gly Leu Asn He Ser He Glu Asp Lys Arg 

210 215 220 

caa gca ggc ggg cag tct ttg aac tac cac tat gaa ggt ggg ata teg 
Gin Ala Gly Gly Gin Ser Leu Asn Tyr His Tyr Glu Gly Gly He Ser 
225 230 235 

agt tat gtt gac cac ttg aat tec age cgt gaa gtt ctt tat gag acc 
Ser Tyr Val Asp His Leu Asn Ser Ser Arg Glu Val Leu Tyr Glu Thr 
240 245 250 

cca att ttc ttg gaa ggg gaa gaa gaa ggg att tct gtg gaa att gec 
Pro He Phe Leu Glu Gly Glu Glu Glu Gly He Ser Val Glu He Ala 
255 260 265 270 

etc cag cat acc gat age ttc cat act aat tta atg agt ttt gee aat 
Leu Gin His Thr Asp Ser Phe His Thr Asn Leu Met Ser Phe Ala Asn 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 
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275 280 285 

aac ate cac acc tat gag ggt ggc atg cat att tec ggc ttc aag aca 
Asn He His Thr Tyr Glu Gly Gly Met His He Ser Gly Phe Lys Thr 

290 295 300 

gec ctt acc egg gcg gtc aac gac tat gec egg cag aat aac ttg etc 
Ala Leu Thr Arg Ala Val Asn Asp Tyr Ala Arg Gin Asn Asn Leu Leu 
305 310 315 

cga gag tea gag gat aac ttt acc ggc gat gac gtt egg gaa ggt ctg 
Arg Glu Ser Glu Asp Asn Phe Thr Gly Asp Asp Val Arg Glu Gly Leu 
320 325 330 

acg gtg gtt ttg tea ate aag cac cca gac ccc caa ttt gaa gga caa 
Thr Val Val Leu Ser lie Lys His Pro Asp Pro Gin Phe Glu Gly Gin 
335 340 345 350 

acc aag act aag ctg ggg aac tct gaa gtc aga ggg ata att gac egg 
Thr Lys Thr Lys Leu Gly Asn Ser Glu Val Arg Gly He He Asp Arg 

355 360 365 

etc ttt age cag cac ttt gaa cgt tac etc atg gaa aat cca aag gtt 
Leu Phe Ser Gin His Phe Glu Arg Tyr Leu Met Glu Asn Pro Lys Val 

370 375 380 

ggt aag egg att gtt gac aag gcg ctt ttg get tec aaa gec cgc caa 
Gly Lys Arg He Val Asp Lys Ala Leu Leu Ala Ser Lys Ala Arg Gin 
385 390 395 

gca gec aag aga gec egg gaa gtc acc egg aag aaa tea ggc tta gaa 
Ala Ala Lys Arg Ala Arg Glu Val Thr Arg Lys Lys Ser Gly Leu Glu 
400 405 410 

att age aac eta cca ggt aaa tta get gac tgt tct tec aaa gat cct 
He Ser Asn Leu Pro Gly Lys Leu Ala Asp Cys Ser Ser Lys Asp Pro 
415 420 425 430 

gaa gaa tec gaa etc ttt att gta gaa ggg gat tea get gga ggg teg 
Glu Glu Ser Glu Leu Phe He Val Glu Gly Asp Ser Ala Gly Gly Ser 

435 440 445 

get aag caa ggt egg tec egg gtt ttc cag get att ttg ccg att cgt 
Ala Lys Gin Gly Arg Ser Arg Val Phe Gin Ala He Leu Pro He Arg 

450 455 460 

ggt aag att ttg aat gtc gaa aaa gee age att gac cgt ate tta gec 
Gly Lys He Leu Asn Val Glu Lys Ala Ser He Asp Arg He Leu Ala 
465 470 475 

aat gaa gaa ate egg tct etc ttt aca gee atg gga act ggc ttc ggg 
Asn Glu Glu He Arg Ser Leu Phe Thr Ala Met Gly Thr Gly Phe Gly 
480 485 490 

gaa gaa ttt aat gtt gaa gaa get cgc tac aat aag tta att ate atg 
Glu Glu Phe Asn Val Glu Glu Ala Arg Tyr Asn Lys Leu He He Met 
ASS 500 505 510 
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960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 
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aca gat get gat gtt gac gga gcc cac att egg ace ttg etc ttg acc 
Thr Asp Ala Asp Val Asp Gly Ala His He Arg Thr Leu Leu Leu Thr 

515 520 525 

ctt ctt tac egg tat atg egg ccc ttg att gaa gca ggt ttc gtc tac 
Leu Leu Tyr Arg Tyr Met Arg Pro Leu He Glu Ala Gly Phe Val Tyr 

530 535 540 

att gcc cag cca ccc etc tac cag gtc aag caa ggc aag aag gtt aaa 
He Ala Gin Pro Pro Leu Tyr Gin Val Lys Gin Gly Lys Lys Val Lys 
545 550 555 

tac ttt gat agt gac egg gaa ctg gac tec tac ttg aaa gaa att cct 
Tyr Phe Asp Ser Asp Arg Glu Leu Asp Ser Tyr Leu Lys Glu He Pro 
560 565 570 

gac tea ccc aag cct tct gtc caa cgc tac aaa ggc tta gga gaa atg 
Asp Ser Pro Lys Pro Ser Val Gin Arg Tyr Lys Gly Leu Gly Glu Met 
575 580 585 590 

gat get gag cag ttg tgg gaa acc acc atg aac cca gaa cac cgc cgc 
Asp Ala Glu Gin Leu Trp Glu Thr Thr Met Asn Pro Glu His Arg Arg 

595 600 605 

tta ctt egg gta gac gta gac gac gcc att gag get gac act att ttt 
Leu Leu Arg Val Asp Val Asp Asp Ala He Glu Ala Asp Thr He Phe 

610 615 620 

gac atg ttg atg ggt gag gat gtc aaa ccc egg cgc gac ttt ate aaa 
Asp Met Leu Met Gly Glu Asp Val Lys Pro Arg Arg Asp Phe He Lys 
625 630 635 

gaa aat gcc cgt tac gtg gaa aat ate gat ate tag 
Glu Asn Ala Arg Tyr Val Glu Asn He Asp He 
640 645 



1584 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1956 



<210> 20 
<211> 649 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 20 

Met Ala Glu Asp Arg Pro Leu Thr Pro Asn Glu Leu Ala Glu Leu Lys 
15 10 15 

Lys Thr Tyr Asp Ala Ser Gin He Gin Val Leu Glu Gly Leu Glu Ala 

20 25 30 

Val Arg Val Arg Pro Gly Met Tyr He Gly Ser Thr Ser Lys Glu Gly 
35 40 45 



Leu His His Leu Val Trp Glu He Val Asp Asn Ala He Asp Glu Ala 
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50 



55 60 



Met Ala Gly Tyr Ala Asp Lys He Ser Val Ser He Leu Glu Gly Asp 
65 70 75 80 

Val He Gin Val Ala Asp Asn Gly Arg Gly He Pro Val Asp He Gin 

85 90 95 

Glu Lys Thr Gly Arg Pro Ala Val Glu Thr Val Phe Thr Val Leu His 

100 105 HO 

Ala Gly Gly Lys Phe Gly Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu 
115 120 125 

His Gly Val Gly Ser Ser Val Val Asn Ala Leu Ser Glu Tyr Leu Gin 
130 135 140 

Val Gin Val His Arg Asp Gly Lys He Tyr Gin Gin Val Tyr Lys Arg 
145 150 155 160 

Gly Leu Val Asp Ser Asp Leu Glu Val Val Gly Glu Thr Asp His Thr 

165 170 175 

Glv Thr He Val Thr Phe Lys Ala Asp Ser Leu He Phe Lys Asp Thr 

180 185 190 

Thr Ser Tyr Asp Phe Asn Thr Leu Ala Thr Arg He Arg Glu Leu Ala 
195 200 205 

Phe Leu Asn Arg Gly Leu Asn He Ser He Glu Asp Lys Arg Gin Ala 
210 215 220 

Gly Gly Gin Ser Leu Asn Tyr His Tyr Glu Gly Gly He Ser Ser Tyr 
225 230 235 240 

Val Asp His Leu Asn Ser Ser Arg Glu Val Leu Tyr Glu Thr Pro He 

245 250 255 

Phe Leu Glu Gly Glu Glu Glu Gly He Ser Val Glu He Ala Leu Gin 

260 265 270 

His Thr Asp Ser Phe His Thr Asn Leu Met Ser Phe Ala Asn Asn He 
275 280 285 
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His Thr Tyr Glu Gly Gly Met His He Ser Gly Phe Lys Thr Ala Leu 
290 295 300 

Thr Arg Ala Val Asn Asp Tyr Ala Arg Gin Asn Asn Leu Leu Arg Glu 
305 310 315 3ZO 

Ser Glu Asp Asn Phe Thr Gly Asp Asp Val Arg Glu Gly Leu Thr Val 

325 330 . 335 

Val Leu Ser He Lys His Pro Asp Pro Gin Phe Glu Gly Gin Thr Lys 

340 345 350 

Thr Lys Leu Gly Asn Ser Glu Val Arg Gly He He Asp Arg Leu Phe 
355 360 365 

Ser Gin His Phe Glu Arg Tyr Leu Met Glu Asn Pro Lys Val Gly Lys 
370 . 375 380 

Arg He Val Asp Lys Ala Leu Leu Ala Ser Lys Ala Arg Gin Ala Ala 
385 390 395 

Lys Arg Ala Arg Glu Val Thr Arg Lys Lys Ser Gly Leu Glu lie Ser 

405 410 415 

Asn Leu Pro Gly Lys Leu Ala Asp Cys Ser Ser Lys Asp Pro Glu Glu 

420 425 430 

Ser Glu Leu Phe He Val Glu Gly Asp Ser Ala Gly Gly Ser Ala Lys 
435 440 445 

Gin Gly Arg Ser Arg Val Phe Gin Ala He Leu Pro He Arg Gly Lys 
450 455 460 

He Leu Asn Val Glu Lys Ala Ser He Asp Arg He Leu Ala Asn Glu 
465 470 475 480 

Glu He Arg Ser Leu Phe Thr Ala Met Gly Thr Gly Phe Gly Glu Glu 

485 490 495 

Phe Asn Val Glu Glu Ala Arg Tyr Asn Lys Leu He He Met Thr Asp 

500 505 510 
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Ala Asp Val Asp Gly Ala His lie Arg Thr Leu Leu Leu Thr Leu Leu 
515 520 525 

Tyr Arg Tyr Met Arg Pro Leu He Glu Ala Gly Phe Val Tyr He Ala 
530 ~ 535 540 



Gin Pro Pro Leu Tyr Gin Val Lys Gin Gly Lys Lys Val Lys Tyr Phe 
545 



550 555 560 



Asp Ser Asp Arg Glu Leu Asp Ser Tyr Leu Lys Glu He Pro Asp Ser 

565 570 575 

Pro Lys Pro Ser Val Gin Arg Tyr Lys Gly Leu Gly Glu Met Asp Ala 

580 . 585 590 

Glu Gin Leu Trp Glu Thr Thr Met Asn Pro Glu His Arg Arg Leu Leu 
595 600 605 

Arg Val Asp Val Asp Asp Ala He Glu Ala Asp Thr He Phe Asp Met 
610 615 620 

Leu Met Gly Glu Asp Val Lys Pro Arg Arg Asp Phe He Lys Glu Asn 
625 630 635 640 

Ala Arg Tyr Val Glu Asn He Asp lie 

645 



<210> 21 
<211> 1218 
<212> DNA • 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (1218) 

<223> 

<400> 21 . . ^ 

agacctaatc atttt ttg aaa tgg aga aag aca aaa acc ate tat ggt ata 

Met Lys Trp Arg Lys Thr Lys Thr He Tyr Gly lie 



1 => 10 



ctt aag aac aaa agg aag ttt gga ggg att ttt ttg aaa ttt tea gta 
Leu Lys Asn Lys Arg Lys Phe Gly Gly He Phe Leu Lys Phe Ser Val 
15 .20 25 

aaa egg acg gaa ttt eta aaa gta tta aaa aaa gta cag att gca gtg 



99 



147 
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Lys Arg Thr Glu Phe Leu Lys Val Leu Lys Lys Val Gin He Ala Val 
30 35 40 

tct tct aaa agt acc ate get ate ttg ace ggg att aaa tta gaa gcg 
Ser Ser Lys Ser Thr He Ala He Leu Thr Gly He Lys Leu Glu Ala 
45 50 55 60 

gat aac cag ggt tta acc tta acegga tct aac teg gat ate tea gtt 
Asp Asn Gin Gly Leu Thr Leu Thr Gly Ser Asn Ser Asp He Ser Val 

65 70 75 

gaa agt tac tta tct gtg acc gat gaa ggg gcg gat ttg gtt att gat 
Glu Ser Tyr Leu Ser Val Thr Asp Glu Gly Ala Asp Leu Val He Asp 

80 85 90 

gag ccg ggg cag att gtc ttg caa cca gee egg tta ttt gee aat ate 
Glu Pro Gly Gin He Val Leu Gin Pro Ala Arg Leu Phe Ala Asn He 
95 100 105 

gtc caa aaa tta ccg gac acc cac ttt aag gta aac gtt age caa ggc 
Val Gin Lys Leu Pro Asp Thr His Phe Lys Val Asn Val Ser Gin Gly 
110 115 120 

cag caa acc caa ate acc tea get tea gee tec ttt act ate aac ggc 
Gin Gin Thr Gin He Thr Ser Ala Ser Ala Ser Phe Thr He Asn Gly 
--- 130 135 140 



125 



att gac gee atg tec tac ccc cac ttg cca gat ate gac ctg gag gaa 
He Asp Ala Met Ser Tyr Pro His Leu Pro Asp He Asp Leu Glu Glu 

145 150 155 

tec ttt acc ctg ccg gtt gac etc ttt aaa aac atg ate aac cag act 
Ser Phe Thr Leu Pro Val Asp Leu Phe Lys Asn Met He Asn Gin Thr 

160 165 170 

gtc ate gca gtc tec aac cat gaa agt egg ccc ate eta act ggg gtt 
Val He Ala Val Ser Asn His Glu Ser Arg Pro He Leu Thr Gly Val 
175 180 185 

aac eta tct etc aaa gag ggc cga etc aag gca gtg gca acc gac age 
Asn Leu Ser Leu Lys Glu Gly Arg Leu Lys Ala Val Ala Thr Asp Ser 
190 195 200 

cac cgt ttg teg caa egg tec ate caa tta gag tea gcg cct gat att 
His Arg Leu Ser Gin Arg Ser He Gin Leu Glu Ser Ala Pro Asp lie 
205 210 215 220 

tec ttt gac att gtg ata cca ggt aag tct ttg act gaa ctg act aag 
Ser Phe Asp He Val He Pro Gly Lys Ser Leu Thr Glu Leu Thr Lys 

225 230 235 

ttg atg gat gca gat gaa gaa gtc egg gta gee ate age gac aac caa 
Leu Met Asp Ala Asp Glu Glu Val Arg Val Ala He Ser Asp Asn Gin 

240 245 250 

ate eta ttt gee etc tec age age cag ttt tac tct egg etc eta gaa 
He Leu Phe Ala Leu Ser Ser Ser Gin Phe Tyr Ser Arg Leu Leu Glu 



195 



243 



291 



339 



387 



435 



483 



531 



579 
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255 260 265 

ggt aag tat cct gat acc gac cgc ttg ate cca ggc gac acc cca acg 

Gly Lys Tyr Pro Asp Thr Asp Arg Leu He Pro Gly Asp Thr Pro Thr 
270 275 280 



tec etc etc tec cat gaa ggg aaa aac aat gtg gtc caa etc aca gtg 
Ser Leu Leu Ser His Glu Gly Lys Asn Asn Val Val Gin Leu Thr Val 

305 310 315 



gtc caa gaa gaa att gac ttt ggc cac ttc caa ggc caa gac tta acc 
Val Gin Glu Glu He Asp Phe Gly His Phe Gin Gly Gin Asp Leu Thr 
335 340 345 

att tct ttc aac ccc gac tac tta aaa gag gee ttg get acc ttt ggt 
lie Ser Phe Asn Pro Asp Tyr Leu Lys Glu Ala Leu Ala Thr Phe Gly 
350 355 360 



ate gtc cca agt gag gac caa gga gac ttt ate caa ctt att act cca 
He Val Pro Ser Glu Asp Gin Gly Asp Phe He Gin Leu He Thr Pro 

385 390 395 

ate cga aca gee taa 
He Arg Thr Ala 

400 



867 



gaa ate acc ttg gac acc aag gaa tta cag ggg get gtt gac egg get 915 
Glu He Thr Leu Asp Thr Lys Glu Leu Gin Gly Ala Val Asp Arg Ala 
285 290 295 300 



963 



act get gaa aag ttg gaa ate gaa ggc cag tea get gaa gtg ggc cat 1011 
Thr Ala Glu Lys Leu Glu He Glu Gly Gin Ser Ala Glu Val Gly His 

320 325 330 



1059 



1107 



caa gga gaa att aag ttg aaa tta gtt teg acc ttg cga ccc ttt gtc 1155 
Gin Gly Glu He Lys Leu Lys Leu Val Ser Thr Leu Arg Pro Phe Val 
365 370 375 380 



1203 



1218 



<210> 22 
<211> 400 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 22 

Met Lys Trp Arg Lys Thr Lys Thr He Tyr Gly He Leu Lys Asn Lys 
15. 10 15 



Arg Lys Phe Gly Gly He Phe Leu Lys Phe Ser Val Lys Arg Thr Glu 

20 25 30 



Phe Leu Lys Val Leu Lys Lys Val Gin He Ala Val Ser Ser Lys Ser 
35 40 45 
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Thr He Ala lie Leu Thr Gly He" Lys Leu Glu Ala Asp Asn Gin Gly 
50 55 60 

Leu Thr Leu Thr Gly Ser Asn Ser Asp He Ser Val Glu Ser Tyr Leu 
65 70 75 80 

Ser Val Thr Asp Glu Gly Ala Asp Leu Val He Asp Glu Pro Gly Gin 

85 90 95 

He Val Leu Gin Pro Ala Arg Leu Phe Ala Asn He Val Gin Lys Leu 

100 105 HO 

Pro Asp Thr His Phe Lys Val Asn Val Ser Gin Gly Gin Gin Thr Gin 
115 120 125 

He Thr Ser Ala Ser Ala Ser Phe Thr He Asn Gly He Asp Ala Met 
130 135 140 

Ser Tyr Pro His Leu Pro Asp He Asp Leu Glu Glu Ser Phe Thr Leu 
145 150 155 160 

Pro Val Asp Leu Phe Lys Asn Met He Asn Gin Thr Val He Ala Val 

165 170 175 

Ser Asn His Glu Ser Arg Pro He Leu Thr Gly Val Asn Leu Ser Leu 

180 185 190 

Lys Glu Gly Arg Leu Lys Ala Val Ala Thr Asp Ser His Arg Leu Ser 
195 200 205 

Gin Arg Ser He Gin Leu Glu Ser Ala Pro Asp He Ser Phe Asp He 
210 215 220 

Val He Pro Gly Lys Ser Leu Thr Glu Leu Thr Lys Leu Met Asp Ala 
225 230 235 240 

Asp Glu Glu Val Arg Val Ala He Ser Asp Asn Gin He Leu Phe Ala 

245 250 255 

Leu Ser Ser Ser Gin Phe Tyr Ser Arg Leu Leu Glu Gly Lys Tyr Pro 

260 265 270 

Asp Thr Asp Arg Leu He Pro Gly Asp Thr Pro Thr Glu He Thr .Leu 



WO 03/104391 



60/235 



PCT7US02/36122 



275 280 285 

Asp Thr Lys Glu Leu Gin Gly Ala Val Asp Arg Ala Ser Leu Leu Ser 
290 295 300 

His Glu Gly Lys Asn Asn Val Val Gin Leu Thr Val Thr Ala Glu Lys 
305 310 315 320 

Leu Glu lie Glu Gly Gin Ser Ala Glu Val Gly His Val Gin Glu Glu 

325 330 335 

He Asp Phe Gly His Phe Gin Gly Gin Asp Leu Thr He Ser Phe Asn 

340 345 350 

Pro Asp Tyr Leu Lys Glu Ala Leu Ala Thr Phe Gly Gin Gly Glu He 
355 360 365 

Lys Leu Lys Leu Val Ser Thr Leu Arg Pro Phe Val He Val Pro Ser 
370 375 380 

Glu Asp Gin Gly Asp Phe He Gin Leu He Thr Pro He Arg Thr Ala 
385 390 395 400 



<210> 23 
<211> 1317 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25) . . (1317) 

<223> 

<400> 23 

tcaataactg cttttttagg agct ttg cag atg aat tgg aaa gaa acc ate 

Met Gin Met Asn Trp Lys Glu Thr He 
1 5 

agt etc ate aac acc acc egg ggg acc gga gac aag aaa aat ttg aac 
Ser Leu He Asn Thr Thr Arg Gly Thr Gly Asp Lys Lys Asn Leu Asn 
10 15 20 25 

egg atg cga ctt tta etc aaa gag eta ggt aat cct gaa aca gac ttg 
Arg Met Arg Leu Leu Leu Lys Glu Leu Gly Asn Pro Glu Thr Asp Leu 

30 35 40 



51 



99 



147 
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ccg gtc ate cac gtt get ggc acc aat ggc aaa ggg acg ace tgt get 195 
Pro Val lie His Val Ala Gly Thr Asn Gly Lys Gly Thr Thr Cys Ala 

45 50 55 



tat att gee cac age ttg gee cgt get ggt tat aaa aca gga ctt tac 
Tyr lie Ala His Ser Leu Ala Arg Ala Gly Tyr Lys Thr Gly Leu Tyr 
60 65 70 

acc age ccc cac ctg gag egg gtc aat gaa egg ate egg att aat gac 
Thr Ser Pro His Leu Glu Arg Val Asn Glu Arg lie Arg lie Asn Asp 
75 80 85 

cgc tac ata tec gac caa gac tta atg get ttg acc ggt caa att gee 
Arg Tyr lie Ser Asp Gin Asp Leu Met Ala Leu Thr Gly Gin lie Ala 
90 95 100 105 

ccc ate att gac cat eta gaa gac tgc ttg ggt gag aaa tac tat tct 
Pro lie lie Asp His Leu Glu Asp Cys Leu Gly Glu Lys Tyr Tyr Ser 

110 115 I 20 

ttt gaa att tta act gee ctt gee ttc ttg tac ttc cag caa gca ggg 
Phe Glu lie Leu Thr Ala Leu Ala Phe Leu Tyr Phe Gin Gin Ala Gly 

125 130 I 35 

gtg gac ttt tta gtt tta gaa act ggg gta ggg gga aaa att gat gcg 
Val Asp Phe Leu Val Leu Glu Thr Gly Val Gly Gly Lys He Asp Ala 
140 1^5 150 

acc aat gtg gtg ccc get cca ctg gtc tea gtc att ate tct att ggc 
Thr Asn Val Val Pro Ala Pro Leu Val Ser Val He He Ser He Gly 
155 160 165 

tat gac cac acc cat gtc ttg ggt aat acc ctg gaa gac att acc egg 
Tyr Asp His Thr His Val Leu Gly Asn Thr Leu Glu Asp He Thr Arg 
170 175 180 185 

cac aag gca ggg att att aag aaa ggc tgt ccg gtg gtg gtg ggc cct 
His Lys Ala Gly lie He Lys Lys Gly Cys Pro Val Val Val Gly Pro 

190 195 200 

ctt gec gac cat tta ttg get att gtt aaa gag gtg tec aaa gaa atg 
Leu Ala Asp His Leu Leu Ala He Val Lys Glu Val Ser Lys Glu Met 

205 210 215 

gac agt aat tta acc att gtc cat ccc gac aag ttt gac att gtt cat 
Asp Ser Asn Leu Thr He Val His Pro Asp Lys Phe Asp He Val Has 
220 225 230 

caa acc ctt gac tac cag tec ttt aaa tac ggt ggg gac ttg gtt tta 
Gin Thr Leu Asp Tyr Gin Ser Phe Lys Tyr Gly Gly Asp Leu Val Leu 
235 240 245 

gag act caa atg att ggt aac cac cag ctg gta aac act gec eta get 
Glu Thr Gin Met He Gly Asn His Gin Leu Val Asn Thr Ala Leu Ala 
250 255 260 265 

tat gaa gee ttg aag att gtc caa caa tct tac ccc gat ttg aca gat 



243 



291 



339 



387 



435 



483 



531 



579 



627 



675 



723 



771 



819 



867 
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tct ttg act gac aag ggc tac cag get tec agt gtg get age etc caa 
Ser Leu Thr Asp Lys Gly Tyr Gin Ala Ser Ser Val Ala Ser Leu Gin 
380 " 385 390 

gee ate tta gac tac ata aac cag caa gca aaa gca gat gaa att ate 
Ala lie Leu Asp Tyr lie Asn Gin Gin Ala Lys Ala Asp Glu lie He 
395 400 405 

att ate ttt ggc tec etc tac ttg gtt ggc gac ttc eta aaa ctt tac 
He He Phe Gly Ser Leu Tyr Leu Val Gly Asp Phe Leu Lys Leu Tyr 
410 415 420 425 

cat gaa gca tec ggt taa 
His Glu Ala Ser Gly 

430 



915 



963 



1011 



Tyr Glu Ala Leu Lys He Val Gin Gin Ser Tyr Pro Asp Leu Thr Asp 

270 275 280 

tta gat ata tta gaa ggc ttg aag acg acc cac tgg cca ggc egg atg 
Leu Asp He Leu Glu Gly Leu Lys Thr Thr His Trp Pro Gly Arg Met 

285 290 295 

caa aag eta tct gac cag cca gtg gtt gtt ctt gat ggg gec cac aac 
Gin Lys Leu Ser Asp Gin Pro Val Val Val Leu Asp Gly Ala His Asn 
300 305 310 

gaa ate ggg gtc aag get ctt aga cag tea att gac cac ttt ttc ccc 
Glu He Gly Val Lys Ala Leu Arg Gin Ser He Asp His Phe Phe Pro 
315 320 325 

ggc aaa aaa ate acc tat ttt gee gga atg atg gtc gaa aaa gac ttc 
Gly Lys Lys He Thr Tyr Phe Ala Gly Met Met Val Glu Lys Asp Phe 
330 335 340 345 

gee aaa atg ttt gac etc ctg ggg gaa aca get gat aaa ttt tac ttg 
Ala Lys Met Phe Asp Leu Leu Gly Glu Thr Ala Asp Lys Phe Tyr Leu 

350 355 360 

att tea ccc gat ttg act cgc ggt ttt gat gtc gac caa gee gtt caa 1155 
He Ser Pro Asp Leu Thr Arg Gly Phe Asp Val Asp Gin Ala Val Gin 

365 370 375 



1059 



1107 



1203 



1251 



1299 



1317 



<210> 24 
<211> 430 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 24 

Met Gin Met Asn Trp Lys Glu Thr He Ser Leu He Asn Thr Thr Arg 
1 5 10 15 



Gly Thr Gly Asp Lys Lys Asn Leu Asn Arg Met Arg Leu Leu Leu Lys 

20 25 30 
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Glu Leu Gly Asn Pro Glu Thr Asp Leu Pro Val He His Val Ala Gly 
35 40 45 

Thr Asn Gly Lys Gly Thr Thr Cys Ala Tyr He Ala His Ser Leu Ala 
50 55 60 

Arg Ala Gly Tyr Lys Thr Gly Leu Tyr Thr Ser Pro His Leu Glu Arg 
65 70 75 80 

Val Asn Glu Arg He Arg He Asn Asp Arg Tyr He Ser Asp Gin Asp 

85 90 95 



Leu Met Ala Leu Thr Gly Gin He Ala Pro He He Asp His Leu Glu 

100 105 HO 



Asp Cys Leu Gly Glu Lys Tyr Tyr Ser Phe Glu He Leu Thr Ala Leu 
115 120 125 

Ala Phe Leu Tyr Phe Gin Gin Ala Gly Val Asp Phe Leu Val Leu Glu 
13 0 135 140 

Thr Gly Val Gly Gly Lys He Asp Ala Thr Asn Val Val Pro Ala Pro 
145 150 155 160 

Leu Val Ser Val He lie Ser He Gly Tyr Asp His Thr His Val Leu 

165 170 175 



Gly Asn Thr Leu Glu Asp He Thr Arg His Lys Ala Gly He He Lys 

180 185 190 

Lys Gly Cys Pro Val Val Val Gly Pro Leu Ala Asp His Leu Leu Ala 
195 200 205 

lie Val Lys Glu Val Ser Lys Glu Met Asp Ser Asn Leu Thr He Val 
210 " 215 - 220 



His Pro Asp Lys Phe Asp He Val His Gin Thr Leu Asp Tyr Gin Ser 
225 230 235 240 



Phe Lys Tyr Gly Gly Asp Leu Val Leu Glu Thr Gin Met He Gly Asn 

245 250 255 
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His Gin Leu Val Asn Thr Ala Leu Ala Tyr Glu Ala Leu Lys lie Val 

260 265 270 

Gin Gin Ser Tyr Pro Asp Leu Thr Asp Leu Asp lie Leu Glu Gly Leu 
275 280 285 

Lys Thr Thr His Trp Pro Gly Arg Met Gin Lys Leu Ser Asp Gin Pro 
290 295 300 

Val Val Val Leu Asp Gly Ala His Asn Glu lie Gly Val Lys Ala Leu 
305 310 315 320 

Arg Gin Ser lie Asp His Phe Phe Pro Gly Lys Lys lie Thr Tyr Phe 

325 330 335 



Ala Gly Met Met Val Glu Lys Asp Phe Ala Lys Met Phe Asp Leu Leu 

340 345 350 

Gly Glu Thr Ala Asp Lys Phe Tyr Leu He Ser Pro Asp Leu Thr Arg 
355 *" 360 365 

Gly Phe Asp Val Asp Gin Ala Val Gin Ser Leu Thr Asp Lys Gly Tyr 
370 " 375 380 

Gin Ala Ser Ser Val Ala Ser Leu Gin Ala He Leu Asp Tyr He Asn 
385 390 395 400 

Gin Gin Ala Lys Ala Asp Glu He He lie He Phe Gly Ser Leu Tyr 

405 410 415 



Leu Val Gly Asp Phe Leu Lys Leu Tyr His Glu Ala Ser Gly 

420 425 430 



<210> 25 
<211> 1653 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (91) . . (1653) 

<223> 

<400> 25 

cttcttgttt catttttaat cttatctgaa acaaatgatt tttcaactct tttttatctt 



60 
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actttaaaag ttttagttag gagccttagc ttg tac cgt ate tct atg aaa gac 114 

Met Tyr Arg lie Ser Met L»ys Asp 
1 5 



ttg cat gec eta tta get age aag cag cag ttg aaa gaa gtg gtc ggt 
Leu His Ala Leu Leu Ala Ser Lys Gin Gin Leu Lys Glu Val Val Gly 
10 15 20 

ccc gac caa gtt tgg cat tac aat ttg cct caa ggg gaa ttg gee gac 
Pro Asp Gin Val Trp His Tyr Asn Leu Pro Gin Gly Glu Leu Ala Asp 
25 30 35 40 

caa gtt ttt gac aaa ctt tec tac aat tec caa act gec tec tea gac 
Gin Val Phe Asp Lys Leu Ser Tyr Asn Ser Gin Thr Ala Ser Ser Asp 

45 50 55 

ace ctt ttc ttt tgc aag ggt get tec ttt aaa aga gac tac eta gee 
Thr Leu Phe Phe Cys Lys Gly Ala Ser Phe Lys Arg Asp Tyr Leu Ala 

60 65 70 



162 



210 



258 



306 



450 



498 



546 



cag gcg gtt gac cag ggt gtc caa gtc tat att tec gaa aaa ttg tat 354 
Gin Ala Val Asp Gin Gly Val Gin Val Tyr lie Ser Glu Lys Leu Tyr 
75 80 85 

caa ggc ctg gat get tat gee ate att gtc cgt gac ate cgc cag ace 402 
Gin Gly Leu Asp Ala Tyr Ala lie lie Val Arg Asp He Arg Gin Thr 
90 95 100 

atg gee eta gtc get aag get ttt tac cag get cca gat gaa aaa ttg 
Met Ala Leu Val Ala Lys Ala Phe Tyr Gin Ala Pro Asp Glu Lys Leu 
105 HO 115 120 

ace ctg att ggc att ace ggg acc aag ggc aag aca acc aca agt tac 
Thr Leu He Gly He Thr Gly Thr Lys Gly Lys Thr Thr Thr Ser Tyr 

125 130 135 

etc etc aaa tec ate ctg gac cag gac caa gee ggt aag aca get att 
Leu Leu Lys Ser He Leu Asp Gin Asp Gin Ala Gly Lys Thr Ala He 

140 145 150 

att tea acc ttg ggg att tec tta gac ggc cag acc caa gaa gaa gee 
He Ser Thr Leu Gly He Ser Leu Asp Gly Gin Thr Gin Glu Glu Ala 
155 160 165 

tec ctg acc act cct gaa gee ttg gac etc tac cag atg att gee egg 
Ser Leu Thr Thr Pro Glu Ala Leu Asp Leu Tyr Gin Met He Ala Arg 
170 175 180 

gec caa gac cag ggg atg gac caa ttg att atg gaa gta tct age caa 
Ala Gin Asp Gin Gly Met Asp Gin Leu He Met Glu Val Ser Ser Gin 
185 190 195 200 

gee tac aag atg gac egg gtc tat gga ctg act ttc gac ttt gga gee 
Ala Tyr Lys Met Asp Arg Val Tyr Gly Leu Thr Phe Asp Phe Gly Ala 

205 210 215 



594 



642 



690 



738 
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acc gaa gat gac ccc aat ttt gaa gac gtt caa get ate tgc caa gaa 
Thr Glu Asp Asp Pro Asn Phe Glu Asp Val Gin Ala lie Cys Gin Glu 
425 430 435 4 ^0 



834 



882 



930 



978 



1026 



1074 



ttc tta aat att teg cct gac cat ate ggc cct aat gag cac cca gat 786 
Phe Leu Asn He Ser Pro Asp His He Gly Pro Asn Glu His Pro Asp 

220 225 230 

atg gaa gat tac ttc tat tgt aaa agt cgt ttg gtt aaa cat tec aag 
Met Glu Asp Tyr Phe Tyr Cys Lys Ser Arg Leu Val Lys His Ser Lys 
235 240 245 

ttg gee ttg etc aac get gga ctt gac cag eta gac tac tta aaa gac 
Leu Ala Leu Leu Asn Ala Gly Leu Asp Gin Leu Asp Tyr Leu Lys Asp 
250 255 260 

ctt age caa aaa aat ggc ggt cag gtc caa gtt tac ggc caa gat ccc 
Leu Ser Gin Lys Asn Gly Gly Gin Val Gin Val Tyr Gly Gin Asp Pro 
265 270 275 280 

aag act tgt gac tac tat ttt gag gtt aac aac cag gac age cgc cgc 
Lvs Thr Cys Asp Tyr Tyr Phe Glu Val Asn Asn Gin Asp Ser Arg Arg 

285 290 295 

ttt gee att aaa age caa age cct gat gac ttg gee att gat ggg gat 
Phe Ala He Lys Ser Gin Ser Pro Asp Asp Leu Ala He Asp Gly Asp 

300 305 310 

tac caa ttt gaa atg ttg ggt gat ttt aac aag gag aat gec ctt tgt 
Tyr Gin Phe Glu Met Leu Gly Asp Phe Asn Lys Glu Asn Ala Leu Cys 
315 320 325 

gee get ctt ata gcg ggg cat tta gaa gtt ggg caa gag gee att tac 
Ala Ala Leu He Ala Gly His Leu Glu Val Gly Gin Glu Ala He Tyr 
330 335 340 

caa gga ata gee cag gee caa gtg cca gga egg atg cag cat tat act 
Gin Gly He Ala Gin Ala Gin Val Pro Gly Arg Met Gin His Tyr Thr 
345 350 355 360 

tat ggc aac aat cac ate tat gta gac ttt gee cac aat tac ate age 
Tyr Gly Asn Asn His He Tyr Val Asp Phe Ala His Asn Tyr He Ser 

365 370 375 

ttg aaa aat ctt ttt gat ttt gee caa gac caa cac ccg gac cac acc 
Leu Lys Asn Leu Phe Asp Phe Ala Gin Asp Gin His Pro Asp His Thr 

380 385 390 

atg gtg gtt gtc ttg ggg gee cct ggc aac aag ggg gtg tct cgc cgc 
Met Val Val Val Leu Gly Ala Pro Gly Asn Lys Gly Val Ser Arg Arg 
395 400 405 

aag gat atg gga tac ttg ctg tec caa tac caa ggg gaa gtt ate ttg 1362 
Lys Asp Met Gly Tyr Leu Leu Ser Gin Tyr Gin Gly Glu Val He Leu 
410 415 420 



1122 



1170 



1218 



1266 



1314 



1410 
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att gcc caa tac att gat ggc ccc ate cag gtg acc ttt aat gat aac 

He Ala Gin Tyr He Asp Gly Pro He Gin Val Thr Phe Asn Asp Asn 

445 450 455 

egg ata aat gcc ate caa gac etc eta gag tec tta acc cca gaa agt 

Arg He Asn Ala He Gin Asp Leu Leu Glu Ser Leu Thr Pro Glu Ser 

460 465 470 

caa aaa gtc ate ctg ctt gca ggc aag ggg tec gac cag tac atg ctg 

Gin Lys Val He Leu Leu Ala Gly Lys Gly Ser Asp Gin Tyr Met Leu 

475 480 485 

egg egg ggt gtg aag gaa gat tat gcg gga gac cac aaa ttg gtt gaa 
Arg Arg Gly Val Lys Glu Asp Tyr Ala Gly Asp His Lys Leu Val Glu 

490 495 500 

gca ttt tta aac cag caa aag act tct tct cat gag aag ctt gag ggt 
Ala Phe Leu Asn Gin Gin Lys Thr Ser Ser His Glu Lys Leu Glu Gly 

505 510 515 520 

tag 



1458 



1506 



1554 



1602 



1650 



1653 



<210> 26 
<211> 520 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 26 

Met Tyr Arg He Ser Met Lys Asp Leu His Ala Leu Leu Ala Ser Lys 
1 5 10 15 



Gin Gin Leu Lys Glu Val Val Gly Pro Asp Gin Val Trp His Tyr Asn 

20 25 30 



Leu Pro Gin Gly Glu Leu Ala Asp Gin Val Phe Asp Lys Leu Ser Tyr 
35 40 45 

Asn Ser Gin Thr Ala Ser Ser Asp Thr Leu Phe Phe Cys Lys Gly Ala 
50 55 60 

Ser Phe Lys Arg Asp Tyr Leu Ala Gin Ala Val Asp Gin Gly Val Gin 
65 70 75 80 



Val Tyr He Ser Glu Lys Leu Tyr Gin Gly Leu Asp Ala Tyr Ala He 

85 90 95 



He Val Arg Asp He Arg Gin Thr Met Ala Leu Val Ala Lys Ala Phe 

100 105 110 
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Tyr Gin Ala Pro Asp Glu Lys Leu Thr Leu He Gly He Thr Gly Thr 
115 120 125 



Lys Gly Lys Thr Thr Thr Ser Tyr Leu Leu Lys Ser He Leu Asp Gin 
130 135 140 



Asp Gin Ala Gly Lys Thr Ala He He Ser Thr Leu Gly He Ser Leu 
145 150 155 160 



Asp Gly Gin Thr Gin Glu Glu Ala Ser Leu Thr Thr Pro Glu Ala Leu 

165 170 175 



Asp Leu Tyr Gin Met He Ala Arg Ala Gin Asp Gin Gly Met Asp Gin 

180 185 190 



Leu He Met Glu Val Ser Ser Gin Ala Tyr Lys Met Asp Arg Val Tyr 
195 200 205 



Gly Leu Thr Phe Asp Phe Gly Ala Phe Leu Asn He Ser Pro Asp His 
210 215 220 



He Gly Pro Asn Glu His Pro Asp Met Glu Asp Tyr Phe Tyr Cys Lys 
225 230 235 240 



Ser Arg Leu Val Lys His Ser Lys Leu Ala Leu Leu Asn Ala Gly Leu 

245 250 255 



Asp Gin Leu Asp Tyr Leu Lys Asp Leu Ser Gin Lys Asn Gly Gly Gin 

260 265 270 



Val Gin Val Tyr Gly Gin Asp Pro Lys Thr Cys Asp Tyr Tyr Phe Glu 
275 280 285 



Val Asn Asn Gin Asp Ser Arg Arg Phe Ala He Lys Ser Gin Ser Pro 
290 295 300 



Asp Asp Leu Ala He Asp Gly Asp Tyr Gin Phe Glu Met Leu Gly Asp 
305 310 315 320 



Phe Asn Lys Glu Asn Ala Leu Cys Ala Ala Leu He Ala Gly His Leu 

325 330 335 



Glu Val Gly Gin Glu Ala He Tyr Gin Gly He Ala Gin Ala Gin Val 
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340 345 350 



Pro Gly Arg Met Gin His Tyr Thr Tyr Gly Asn Asn His lie Tyr Val 
355 360 365 



Asp Phe Ala His Asn Tyr lie Ser Leu Lys Asn Leu Phe Asp Phe Ala 
370 375 380 



Gin Asp Gin His Pro Asp His Thr Met Val Val Val Leu Gly Ala Pro 
385 390 395 400 



Gly Asn Lys Gly Val Ser Arg Arg Lys Asp Met Gly Tyr Leu Leu Ser 

405 410 415 



Gin Tyr Gin Gly Glu Val He Leu Thr Glu Asp Asp Pro Asn Phe Glu 

420 425 430 



Asp Val Gin Ala He Cys Gin Glu He Ala Gin Tyr lie Asp Gly Pro 
435 440 445 



He Gin Val Thr Phe Asn Asp Asn Arg He Asn Ala He Gin Asp Leu 
450 455 460 



Leu Glu Ser Leu Thr Pro Glu Ser Gin Lys Val He Leu Leu Ala Gly 
465 470 475 480 



Lys Gly Ser Asp Gin Tyr Met Leu Arg Arg Gly Val Lys Glu Asp Tyr 

485 490 495 



Ala Gly Asp His Lys Leu Val Glu Ala Phe Leu Asn Gin Gin Lys Thr 

500 505 510 



Ser Ser His Glu Lys Leu Glu Gly 
515 520 



<210> 27 
<211> 636 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25) . . (636) 

<223> 
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<400> 27 

aggactgaaa ggaaggttgg agtc atg aag tgg ttg agt egg ate ttg att 51 

Met Lys Trp I/eu Ser Arg He Leu He 
1 5 



gtt gtt ggg ata ggc ttt ctg att gec ttt ggc tac acg att tat gac 
Val Val Gly He Gly Phe Leu He Ala Phe Gly Tyr Thr He Tyr Asp 
10 15 20 25 

cat get aac teg aca teg gtt ace eta gaa gaa gee cag gtg gee ctg 
His Ala Asn Ser Thr Ser Val Thr Leu Glu Glu Ala Gin Val Ala Leu 

30 35 40 



ggt cac gtt gag aat aca gtc ttc cct ggc caa ggc gaa caa att gtc 
Gly His Val Glu Asn Thr Val Phe Pro Gly Gin Gly Glu Gin He Val 

110 115 120 

etc tct ggc cac egg gat ace gtc ttc egg gac ttt ggc gaa tta gaa 
Leu Ser Gly His Arg Asp Thr Val Phe Arg Asp Phe Gly Glu Leu Glu 

125 130 135 

att ggc gac aat ttt ate gtt caa atg cct tac ggg gac tat gaa tat 
He Gly Asp Asn Phe He Val Gin Met Pro Tyr Gly Asp Tyr Glu Tyr 
140 145 150 

gag att cag gac tat gaa att gtc gac egg gat gat acc tec gtc ate 
Glu He Gin Asp Tyr Glu He Val Asp Arg Asp Asp Thr Ser Val He 
155 160 165 

egg cct atg ggg gaa gaa gtc tta gtg gtt tea acc tgc tac ccc ttt 
Arg Pro Met Gly Glu Glu Val Leu Val Val Ser Thr Cys Tyr Pro Phe 
170 175 180 185 

gaa ttt tac ggt ttt gee cct gac cgc ttt gtt ttc tat tgt tac ccc 
Glu Phe Tyr Gly Phe Ala Pro Asp Arg Phe Val Phe Tyr Cys Tyr Pro 

190 195 200 

gtt gaa taa 
Val Glu 



99 



147 



gaa gaa age egg gee cag get get gaa get ggg gac ggg gac cag gat 195 
Glu Glu Ser Arg Ala Gin Ala Ala Glu Ala Gly Asp Gly Asp Gin Asp 

45 50 55 

ggc caa gat ggg gcg agt gac ate gat ate caa aac tac cag cct gaa 
Gly Gin Asp Gly Ala Ser Asp He Asp He Gin Asn Tyr Gin Pro Glu 
60 65 70 



243 



get ggg gag get ttt ggg gtc tta gat att ccc aaa etc gac egg age 291 
Ala Gly Glu Ala Phe Gly Val Leu Asp He Pro Lys Leu Asp Arg Ser 
75 80 85 

att ggc att gta gee gga acg gat gca gac tct ctt aag aag ggg gta 339 
He Gly He Val Ala Gly Thr Asp Ala Asp Ser Leu Lys Lys Gly Val 
90 95 100 105 



387 



435 



483 



531 



579 



627 



636 
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<210> 28 
<211> 203 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 28 

Met Lys Trp Leu Ser Arg He Leu He Val Val Gly lie Gly Phe Leu 
1 5 10 15 

He Ala Phe Gly Tyr Thr He Tyr Asp His Ala Asn Ser Thr Ser Val 

20 . 25 30 

Thr Leu Glu Glu Ala Gin Val Ala Leu Glu Glu Ser Arg Ala Gin Ala 
35 40 45 

Ala Glu Ala Gly Asp Gly Asp Gin Asp Gly Gin Asp Gly Ala Ser Asp 
50 55 60 

He Asp He Gin Asn Tyr Gin Pro Glu Ala Gly Glu Ala Phe Gly Val 
65 70 75 80 

Leu Asp He Pro Lys Leu Asp Arg Ser He Gly He Val Ala Gly Thr 

85 90 95 

Asp Ala Asp Ser Leu Lys Lys Gly Val Gly His Val Glu Asn Thr Val 

100 105 110 

Phe Pro Gly Gin Gly Glu Gin He Val Leu Ser Gly His Arg Asp Thr 
115 120 125 

Val Phe Arg Asp Phe Gly Glu Leu Glu He Gly Asp Asn Phe He Val 
130 " 135 140 

Gin Met Pro Tyr Gly Asp Tyr Glu Tyr Glu He Gin Asp Tyr Glu He 
145 150 155 160 

Val Asp Arg Asp Asp Thr Ser Val He Arg Pro Met Gly Glu Glu Val 

165 170 175 



Leu Val Val Ser Thr Cys Tyr Pro Phe Glu Phe Tyr Gly Phe Ala Pro 

180 185 190 



Asp Arg Phe Val Phe Tyr Cys Tyr Pro Val Glu 
195 200 
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<210> 29 
<211> 1290 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (1) . • (1290) 

<223> 



<400> 29 

atg cag tat gca gaa ctt ctt gac etc ctg ccc eta caa gaa caa ggg 
Met Gin Tyr Ala Glu Leu Leu Asp Leu Leu Pro Leu Gin Glu Gin Gly 
1 "* 5 10 15 

aag atg gat ttg ggg eta gca acc atg ace cag gtg atg gac tta ttg 
Lys Met Asp Leu Gly Leu Ala Thr Met Thr Gin Val Met Asp Leu Leu 

20 " 25 30 

ggc aag ccc caa gac cag gtc ccc atg gtt cat ate get ggc acc aat 
Gly Lys Pro Gin Asp Gin Val Pro Met Val His He Ala Gly Thr Asn 
35 40 45 

ggc aag ggg teg gec gca gee ttt aca gag cga ata etc agg gag get 
Gly Lys Gly Ser Ala Ala Ala Phe Thr Glu Arg lie Leu Arg Glu Ala 
50 55 60 

ggc tac aag gtc ggc ttg tat att tec cct tec eta gtg gaa ttt aat 
Gly Tyr Lys Val Gly Leu Tyr He Ser Pro Ser Leu Val Glu Phe Asn 
65 70 75 80 

gaa egg ate caa ate aat ggc caa gee aca agt gat gat cag ttg etc 
Glu Arg He Gin He Asn Gly Gin Ala Thr Ser Asp Asp Gin Leu Leu 

85 90 95 

aag gca gtc aag acc eta age cag gec tta gaa ggc aca tec ctt tgc 
Lys Ala Val Lys Thr Leu Ser Gin Ala Leu Glu Gly Thr Ser Leu Cys 

100 105 HO 

ctg act gaa ttt gaa ctt ttt act gee ctg gee ttt ttg acc ttc cag 
Leu Thr Glu Phe Glu Leu Phe Thr Ala Leu Ala Phe Leu Thr Phe Gin 
115 120 125 



tta gat get acc aat gtg ata age cgt cct gee gtc acc gec att acc 
Leu Asp Ala Thr Asn Val He Ser Arg Pro Ala Val Thr Ala He Thr 
145 ~ 150 155 160 

aag att ggc atg gac cat acc get ttt tta ggg gat age ctg cca gaa 
Lys He Gly Met Asp His Thr Ala Phe Leu Gly Asp Ser Leu Pro Glu 

165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



gac cag get tgt gat ata gec gtt gta gag gtc gga tta gga gga egg 432 
Asp Gin Ala Cys Asp He Ala Val Val Glu Val Gly Leu Gly Gly Arg 
130 135 140 



480 



528 



ata gec ggt gag aag gca gee ate gec aaa gec ggc teg cct atg gtg 



576 
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He Ala Gly Glu Lys Ala Ala lie Ala Lys Ala Gly Ser Pro Met Val 

180 185 190 

gtc tat ccc cag ggg cca gaa gtg act egg gtg ate caa aat cag gcg 
Val Tyr Pro Gin Gly Pro Glu Val Thr Arg Val He Gin Asn Gin Ala 
195 200 205 

gac egg gta gga gee tct ctg ace eta att tct caa tec gac ctg act 
Asp Arg Val Gly Ala Ser Leu Thr Leu He Ser Gin Ser Asp Leu Thr 
210 215 220 

tat aac ctg act teg gac etc ttg caa gac ttt gaa tac aag cag gtt 
Tyr Asn Leu Thr Ser Asp Leu Leu Gin Asp Phe Glu Tyr Lys Gin Val 
225 230 235 240 

ccc tac cgc att tea ctt tta gaa gat tat caa att tac aac gee ctg 
Pro Tyr Arg He Ser Leu Leu Glu Asp Tyr Gin He Tyr Asn Ala Leu 

245 250 255 

gta gca etc gaa ate tct ttt gee tta cag gat get ggc tgg cag att 
Val Ala Leu Glu He Ser Phe Ala Leu Gin Asp Ala Gly Trp Gin He 

260 265 270 

age cct aaa gee att aaa caa ggt ttg gtt gag ace cgc tgg ccc ggc 
Ser Pro Lys Ala He Lys Gin Gly Leu Val Glu Thr Arg Trp Pro Gly 
275 280 285 

cgt ttt gaa ctt ate gee tct cat ccg ace gtg ate gtt gat ggg tct 
Arg Phe Glu Leu He Ala Ser His Pro Thr Val He Val Asp Gly Ser 
290 295 300 

cat aat gaa gac ggc ctg cag get etc ttg get aac eta gac cgc tac 
His Asn Glu Asp Gly Leu Gin Ala Leu Leu Ala Asn Leu Asp Arg Tyr 
305 310 315 320 

ttt cca gaa caa aaaagg att ggg ate gta ggc atg ttg gee gac aag 
Phe Pro Glu Gin Lys Arg He Gly He Val Gly Met Leu Ala Asp Lys 

325 330 335 

gat gtt gat gee gee eta get cct tta ace aaa age ttt gac egg ctt 
Asp Val Asp Ala Ala Leu Ala Pro Leu Thr Lys Ser Phe Asp Arg Leu 

340 345 350 

tat acg gtg aca ccc gat teg ccg egg ggg atg gca gee cct caa atg 
Tyr Thr Val Thr Pro Asp Ser Pro Arg Gly Met Ala Ala Pro Gin Met 
355 360 365 

r 

aaa gaa aaa ctg ace gaa atg gtg teg ccg tct act egg gtc ata get 

Lys Glu Lys Leu Thr Glu Met Val Ser Pro Ser Thr Arg Val He Ala 
370 ~ 375 380 

tgt gaa agt tat aac cag gee tta gac ctg gca ggt caa gta gee ggc 
Cys Glu Ser Tyr Asn Gin Ala Leu Asp Leu Ala Gly Gin Val Ala Gly 
385 * 390 395 400 

gga gat gac eta att gtc gtt ttt gga agt ttt tat att gtt ggt aag 
Gly Asp Asp Leu lie Val Val Phe Gly Ser Phe Tyr lie Val Gly Lys 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 
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405 410 415 

ttt aga cag ctg att tta gca aga aga aat ggg gaa gtt taa 1290 

Phe Arg Gin Leu He Leu Ala Arg Arg Asn Gly Glu Val 

420 425 



<210> 30 
<211> 429 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 30 

Met Gin Tyr Ala Glu Leu Leu Asp Leu Leu Pro Leu Gin Glu Gin Gly 
15 10 15 



Lys Met Asp Leu Gly Leu Ala Thr Met Thr Gin Val Met Asp Leu Leu 

20 25 30 



Gly Lys Pro Gin Asp Gin Val Pro Met Val His He Ala Gly Thr Asn 
35 40 45 



Gly Lys Gly Ser Ala Ala Ala Phe Thr Glu Arg He Leu Arg Glu Ala 
50 55 60 



Gly Tyr Lys Val Gly Leu Tyr He Ser Pro Ser Leu Val Glu Phe Asn 
65 70 75 80 



Glu Arg He Gin He Asn Gly Gin Ala Thr Ser Asp Asp Gin Leu Leu 

85 90 95 



Lys Ala Val Lys Thr Leu Ser Gin Ala Leu Glu Gly Thr Ser Leu Cys 

100 105 110 



Leu Thr Glu Phe Glu Leu Phe Thr Ala Leu Ala Phe Leu Thr Phe Gin 
115 120 125 



Asp Gin Ala Cys Asp He Ala Val Val Glu Val Gly Leu Gly Gly Arg 
130 135 140 



Leu Asp Ala Thr Asn Val He Ser Arg Pro Ala Val Thr Ala He Thr 
145 " 150 155 160 



Lys He Gly Met Asp His Thr Ala Phe Leu Gly Asp Ser Leu Pro Glu 

165 170 175 
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He Ala Gly Glu Lys Ala Ala He Ala Lys Ala Gly Ser Pro Met Val 

180 185 190 



Val Tyr Pro Gin Gly Pro Glu Val Thr Arg Val He Gin Asn Gin Ala 
195 200 205 



Asp Arg Val Gly Ala Ser Leu Thr Leu He Ser Gin Ser Asp Leu Thr 
210 215 220 



Tyr Asn Leu Thr Ser Asp Leu Leu Gin Asp Phe Glu Tyr Lys Gin Val 
225 230 235 240 



Pro Tyr Arg He Ser Leu Leu Glu Asp Tyr Gin He Tyr Asn Ala Leu 

245 250 255 



Val Ala Leu Glu He Ser Phe Ala Leu Gin Asp Ala Gly Trp Gin He 

260 265 270 



Ser Pro Lys Ala He Lys Gin Gly Leu Val Glu Thr Arg Trp Pro Gly 
275 280 285 



Arg Phe Glu Leu He Ala Ser His Pro Thr Val He Val Asp Gly Ser 
290 295 300 



His Asn Glu Asp Gly Leu Gin Ala Leu Leu Ala Asn Leu Asp Arg Tyr 
305 310 315 320 



Phe Pro Glu Gin Lys Arg He Gly He Val Gly Met Leu Ala Asp Lys 

325 330 335 



Asp Val Asp Ala Ala Leu Ala Pro Leu Thr Lys Ser Phe Asp Arg Leu 

340 345 350 



Tyr Thr Val Thr Pro Asp Ser Pro Arg Gly Met Ala Ala Pro Gin Met 
355 360 365 



Lys Glu Lys Leu Thr Glu Met Val Ser Pro Ser Thr Arg Val He Ala 
370 375 380 



Cys Glu Ser Tyr Asn Gin Ala Leu Asp Leu Ala Gly Gin Val Ala Gly 
385 " 390 395 400 



Gly Asp Asp Leu He Val Val Phe Gly Ser Phe Tyr He Val Gly Lys 



7S 
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405 410 415 



Phe Arg Gin Leu Xle Leu Ala Arg Arg Asn Gly Glu Val 

420 425 



<210> 31 
<211> 387 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (387) 

<223> 

<400> 31 

agaaagagga ataag atg gat aaa aga gat aag ata cgc ttg caa ggg atg 51 

Met Asp Lys Arg Asp Lys lie Arg Leu Gin Gly Met 
15 10 

act ttt cac ggc cac cac ggt ttg atg gag gcc gaa acc aag ttg ggt 99 
Thr Phe His Gly His His Gly Leu Met Glu Ala Glu Thr Lys Leu Gly 
15 20 25 

cag att ttt aaa gtc gac ctt gtc tta gta act gac etc aag tta gcg 147 
Gin lie Phe Lys Val Asp Leu Val Leu Val Thr Asp Leu Lys Leu Ala 
30 35 40 

ggt caa aca gac aag atg ggg cac agt ate cac tac ggg gaa gtt tat 195 
Gly Gin Thr Asp Lys Met Gly His Ser He His Tyr Gly Glu Val Tyr 
45 50 55 60 



gac ctg gtc aag tec att gtg gaa ggt acc ccc ttt aag ctt ttg gag 
Asp Leu Val Lys Ser He Val Glu Gly Thr Pro Phe Lys Leu Leu Glu 

65 70 75 



243 



tec ttg gcg gaa acc eta gcc caa gaa gtt etc aag act ttt gac cag 291 

Seir Leu Ala Glu Thr Leu Ala Gin Glu Val Leu Lys Thr Phe Asp Gin 

80 85 90 

gtt gag gag gtc ttg gtc egg gtc aac aaa ccc cag gcc ccg att cct 339 

Val Glu Glu Val Leu Val Arg Val Asn Lys Pro Gin Ala Pro He Pro 
95 100 105 



ggt gtc ttt gac aat gta gcg gtg gaa ate acc egg gcc cgt cac tag 
Gly Val Phe Asp Asn Val Ala Val Glu He Thr Arg Ala Arg His 
110 115 120 



<210> 32 
<211> 123 
<212> PRT 

<213> Alloiococcus otitidis 



387 



<400> 32 
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Met Asp Lys Arg Asp Lys He Arg Leu Gin Gly Met Thr Phe His Gly 
15 10 15 



His His Gly Leu Met Glu Ala Glu Thr Lys Leu Gly Gin lie Phe Lys 

20 25 30 



Val Asp Leu Val Leu Val Thr Asp Leu Lys Leu Ala Gly Gin Thr Asp 
35 40 45 



Lys Met Gly His Ser He His Tyr Gly Glu Val Tyr Asp Leu Val Lys 
50 55 60 



Ser He Val Glu Gly Thr Pro Phe Lys Leu Leu Glu Ser Leu Ala Glu 
65 70 75 80 



Thr Leu Ala Gin Glu Val Leu Lys Thr Phe Asp Gin Val Glu Glu Val 

85 90 95 



Leu Val Arg Val Asn Lys Pro Gin Ala Pro He Pro Gly Val Phe Asp 

100 * 105 110 



Asn Val Ala Val Glu He Thr Arg Ala Arg His 
115 120 



<210> 33 
<211> 552 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (22) . . (552) 

<223> 

<400> 33 

ataggtaagg aggaatatag a gtg aag ggt gtt atg ata gga etc ggt tct 51 

Met Lys Gly Val Met He Gly Leu Gly Ser 
15 10 

aat atg ggg act aag ttg get tac tta aac egg get ttg gee aaa ata 99 
Asn Met Gly Thr Lys Leu Ala Tyr Leu Asn Arg Ala Leu Ala Lys lie 

15 20 25 

aat age eta gac cag gta gca gtc aag caa gtt tea aag gtt tac cag 147 
Asn Ser Leu Asp Gin Val Ala Val Lys Gin Val Ser Lys Val Tyr Gin 

30 35 40 



act gaa ccg gtg ggc tac aag gac cag gac gat ttt tac aat atg gtt 195 
Thr Glu Pro Val Gly Tyr Lys Asp Gin Asp Asp Phe Tyr Asn Met Val 
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45 50 55 

get ggc ctt gaa att gaa cca ggc aag acc ccc ttg gac etc tta gaa 
Ala Gly Leu Glu lie Glu Pro Gly Lys Thr Pro Leu Asp Leu Leu Glu 
60 65 70 

gac ttg ctg gcg att gag gca gac ctg gac agg aag egg acc att aaa 
Asp Leu Leu Ala He Glu Ala Asp Leu Asp Arg Lys Arg Thr He Lys 
75 80 85 90 

aat ggc ccc cga acc att gac ttg gat gtc ttg ctg gtg gag ggt caa 
Asn Gly Pro Arg Thr lie Asp Leu Asp Val Leu Leu Val Glu Gly Gin 

95 100 105 

gaa att gac cat ccc aag etc caa gtt ccc cac cca agg etc cag gac 
Glu He Asp His Pro Lys Leu Gin Val Pro His Pro Arg Leu Gin Asp 

HO 115 120 

egg gee ttt gtc ttg gtc ccc ttg get gag ttg gac ccc aac tac ctg 
Arg Ala Phe Val Leu Val Pro Leu Ala Glu Leu Asp Pro Asn Tyr Leu 
125 130 135 

gtt cct ggc ata gat aag aca gtt gcg gac ttg ttg get tct tta aac 
Val Pro Gly He Asp Lys Thr Val Ala Asp Leu Leu Ala Ser Leu Asn 
140 145 150 



tta gaa gac cgt gag get tga 
Leu Glu Asp Arg Glu Ala 

175 



243 



291 



339 



387 



435 



483 



caa acc gac eta gca ggg gtg gag get ttg ggt cag ttg acg aac eta 531 
Gin Thr Asp Leu Ala Gly Val Glu Ala Leu Gly Gin Leu Thr Asn Leu 
155 • 160 165 170 



552 



<210> 34 
<211> 176 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 34 

Met Lys Gly Val Met He Gly Leu Gly Ser Asn Met Gly Thr Lys Leu 
i5 10 15 



Ala Tyr Leu Asn Arg Ala Leu Ala Lys He Asn Ser Leu Asp Gin Val 

20 25 30 

Ala Val Lys Gin Val Ser Lys Val Tyr Gin Thr Glu Pro Val Gly Tyr 
35 40 45 



Lys Asp Gin Asp Asp Phe Tyr Asn Met Val Ala Gly Leu Glu He Glu 
50 55 60 
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Pro Gly Lys Thr Pro Leu Asp Leu Leu Glu Asp Leu Leu Ala He Glu 
65 70 75 80 



Ala Asp Leu Asp Arg Lys Arg Thr He Lys Asn Gly Pro Arg Thr He 

85 90 95 



Asp Leu Asp Val Leu Leu Val Glu Gly Gin Glu He Asp His Pro Lys 

100 105 110 



Leu Gin Val Pro His Pro Arg Leu Gin Asp Arg Ala Phe Val Leu Val 
115 120 125 



Pro Leu Ala Glu Leu Asp Pro Asn Tyr Leu Val Pro Gly He Asp Lys 
130 135 140 



Thr Val Ala Asp Leu Leu Ala Ser Leu Asn Gin Thr Asp Leu Ala Gly 
145 150 155 160 



Val Glu Ala Leu Gly Gin Leu Thr Asn Leu Leu Glu Asp Arg Glu Ala 

165 170 175 



<210> 35 
<211> 1242 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (40) . . (1242) 

<223> 

<400> 35 

aatcttctta atatcgcttg gcccaagacc gctataata gtg gta agt gat tat 54 

Met Val Ser Asp Tyr 
1 5 

ttt agg agg ttc aat atg caa ata gga att gac aag ctg get ttt gcg 102 
Phe Arg Arg Phe Asn Met Gin He. Gly He Asp Lys Leu Ala Phe Ala 

10 15 20 

act cca acc agg tac ttg gaa atg gcg agt ctg gec caa gec egg tec ' 150 
Thr Pro Thr Arg Tyr Leu Glu Met Ala Ser Leu Ala Gin Ala Arg Ser 

25 30 35 

caa gac cct aat aaa tat ate aag ggg eta ggc caa gaa gec atg get 198 
Gin Asp Pro Asn Lys Tyr He Lys Gly Leu Gly Gin Glu Ala Met Ala 
40 45 50 
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gtc cct gaa gaa agt gat gat gcc gtc age ttg gcg get aat gec ggt 246 

Val Pro Glu Glu Ser Asp Asp Ala Val Ser Leu Ala Ala Asn Ala Gly 
55 60 65 

aat tta ate tta agt gaa gaa gac aag get get att gac atg gtg ata 294 

Asn Leu lie Leu Ser Glu Glu Asp Lys Ala Ala lie Asp Met Val He 

70 75 80 85 

gtc ggt ace gaa tct ggg gtc gac cag tec aag teg gca gcc age tgg 342 

Val Gly Thr Glu Ser Gly Val Asp Gin Ser Lys Ser Ala Ala Ser Trp 

90 95 100 

gtt cat gac ctg ttg ggg ate aac ccc cat get aga age ctg gag ate 3 90 

Val His Asp Leu Leu Gly He Asn Pro His Ala Arg Ser Leu Glu He 

105 110 115 

aag caa gcc tgc tac ggg get acg get gga etc aaa eta get gtg gcc 438 

Lys Gin Ala Cys Tyr Gly Ala Thr Ala Gly Leu Lys Leu Ala Val Ala 
120 ' 125 130 

cac eta gcc tta aac cct gac tec aag gtt tta gtc ate ggt tea gac 486 

His Leu Ala Leu Asn Pro Asp Ser Lys Val Leu Val He Gly Ser Asp 
135 140 145 

ata gcc aag tat ggt ttg gaa aca ggg ggc gag ccc act caa gga get 534 

He Ala Lys Tyr Gly Leu Glu Thr Gly Gly Glu Pro Thr Gin Gly Ala 

150 155 160 165 

• 

ggg gcg gtc gcc ate tta gtc age cgt gac cct gca att get gtg gtc 582 

Gly Ala Val Ala He Leu Val Ser Arg Asp Pro Ala He Ala Val Val 

170 175 180 

aac aat gac agt gcc atg ctg ace aaa aat att gca gac ttt tgg cga 63 0 

Asn Asn Asp Ser Ala Met Leu Thr Lys Asn He Ala Asp Phe Trp Arg 

185 190 195 

ccc aac tac age gat tat gcc cat gta gat ggc aag ttc tec aac cag 678 

Pro Asn Tyr Ser Asp Tyr Ala His Val Asp Gly Lys Phe Ser Asn Gin 
200 205 210 

gca tac ttg tec aac eta gca gaa gtc tgg cgc cag tat aag ate aaa 726 

Ala Tyr Leu Ser Asn Leu Ala Glu Val Trp Arg Gin Tyr Lys He Lys 
215 220 225 

aac cag ctg tct get aag gat ttc aag gcc atg gtc ttc cac age ccc 774 

Asn Gin Leu Ser Ala Lys Asp Phe Lys Ala Met Val Phe His Ser Pro 

230 235 240 245 

tat ace aag atg ggg aaa aag gcc tta etc aaa eta gga gat tat gaa 822 

Tyr Thr Lys Met Gly Lys Lys Ala Leu Leu Lys Leu Gly Asp Tyr Glu 

250 255 260 

gac cag aaa gag att gac cgc ttg ctg gcc tat tac gag cct ggt cgc 870 

Asp Gin Lys Glu He Asp Arg Leu Leu Ala Tyr Tyr Glu Pro Gly Arg 

265 270 275 
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tac tac aat aag egg gtc ggt aat ate tat act ggg tct ctt tac ttg 918 
Tyr Tyr Asn Lys Arg Val Gly Asn lie Tyr Thr Gly Ser Leu Tyr Leu 
280 285 290 

agt ttg att tec etc tta gac cag gta agt gac ctg gag get ggc gac 966 
Ser Leu lie Ser Leu Leu Asp Gin Val Ser Asp Leu Glu Ala Gly Asp 
295 300 305 

egg att ggc etc tat tct tat ggg tct ggt gee gtt gga gag ttc ttt 1014 
Arg lie Gly Leu Tyr Ser Tyr Gly Ser Gly Ala Val Gly Glu Phe Phe 
310 315 320 325 

age att egg etc cag cca ggt tac aag gaa age tta cag caa gtt gac 1062 
Ser lie Arg Leu Gin Pro Gly Tyr Lys Glu Ser Leu Gin Gin Val Asp 

330 335 340 

ttc gac cag gtt gtc aac cag cgt tea gca tta gag atg tac age tat 1110 
Phe Asp Gin Val Val Asn Gin Arg Ser Ala Leu Glu Met Tyr Ser Tyr 

345 350 355 

cag gac ttg ctg ace ttt age eta cct caa gac ggc caa act tac act 1158 
Gin Asp Leu Leu Thr Phe Ser Leu Pro Gin Asp Gly Gin Thr Tyr Thr 
360 365 370 

aca gat aaa agt cac cag gtc cca ggc cgt ttt gtc tta gac egg gtg 1206 
Thr Asp Lys Ser His Gin Val Pro Gly Arg Phe Val Leu Asp Arg Val 
375 380 385 

gee gac cat ate cgt tac tac egg cgc ttg get taa 1242 
Ala Asp His lie Arg Tyr Tyr Arg Arg Leu Ala 
390 395 400 



<210> 36 
<211> 400 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 36 

Met Val . Ser Asp Tyr Phe Arg Arg Phe Asn Met Gin lie Gly lie Asp 
1 ' 5 10 15 



Lys Leu Ala Phe Ala Thr Pro Thr Arg Tyr Leu Glu Met Ala Ser Leu 

20 25 30 



Ala Gin Ala Arg Ser Gin Asp Pro Asn Lys Tyr lie Lys Gly Leu Gly 

35 40 45 

Gin Glu Ala Met Ala Val Pro Glu Glu Ser Asp Asp Ala Val Ser Leu 
50 55 60 



Ala Ala Asn Ala Gly Asn Leu lie Leu Ser Glu Glu Asp Lys Ala Ala 
65 70 75 80 
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lie Asp Met Val He Val Gly Thr Glu Ser Gly Val Asp Gin Ser Lys 

85 90 95 



Ser Ala Ala Ser Trp Val His Asp Leu Leu Gly He Asn Pro His Ala 

100 105 110 



Arg Ser Leu Glu He Lys Gin Ala Cys Tyr Gly Ala Thr Ala Gly Leu 
115 120 125 



Lys Leu Ala Val Ala His Leu Ala Leu Asn Pro Asp Ser Lys Val Leu 
130 135 140 



Val lie Gly Ser Asp He Ala Lys Tyr Gly Leu Glu Thr Gly Gly Glu 
145 150 155 160 



Pro Thr Gin Gly Ala Gly Ala Val Ala He Leu Val Ser Arg Asp Pro 

165 170 175 



Ala He Ala Val Val Asn Asn Asp Ser Ala Met Leu Thr Lys Asn He 

180 185 190 



Ala Asp Phe Trp Arg Pro Asn Tyr Ser Asp Tyr Ala His Val Asp Gly 
195 200 205 



Lys Phe Ser Asn Gin Ala Tyr Leu Ser Asn Leu Ala Glu Val Trp Arg 
210 215 220 



Gin Tyr Lys He Lys Asn Gin Leu Ser Ala Lys Asp Phe Lys Ala Met 
225 230 235 240 



Val Phe His Ser Pro Tyr Thr Lys Met Gly Lys Lys Ala Leu Leu Lys 

245 250 255 



Leu Gly Asp Tyr Glu Asp Gin Lys Glu He Asp Arg Leu Leu Ala Tyr 

260 265 270 



Tyr Glu Pro Gly Arg Tyr Tyr Asn Lys Arg Val Gly Asn lie Tyr Thr 
275 280 285 



Gly Ser Leu Tyr Leu Ser Leu He Ser Leu Leu Asp Gin Val Ser Asp 
290 295 300 
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Leu Glu Ala Gly Asp Arg He Gly Leu Tyr Ser Tyr Gly Ser Gly Ala 
305 310 315 320 

Val Gly Glu Phe Phe Ser He Arg Leu Gin Pro Gly Tyr Lys Glu Ser 

325 330 335 



Leu Gin Gin Val Asp Phe Asp Gin Val Val Asn Gin Arg Ser Ala Leu 

340 345 350 

Glu Met Tyr Ser Tyr Gin Asp Leu Leu Thr Phe Ser Leu Pro Gin Asp 
355 360 365 

Gly Gin Thr Tyr Thr Thr Asp Lys Ser His Gin Val Pro Gly Arg Phe 
370 375 380 

Val Leu Asp Arg Val Ala Asp His lie Arg Tyr Tyr Arg Arg Leu Ala 
385 390 395 400 



<210> 37 
<211> 1323 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (31) . . (1323) 

<223> 

<400> 37 

ttctggtata gattaaggaa ggaggagacc atg tta ccc tta ttc aag caa ttt 

Met Leu Pro Leu Phe Lys Gin Phe 
1 5 

tac aag caa age etc age cag cgc etc aaa get eta gaa aag gcc ggc 
Tyr Lys Gin Ser Leu Ser Gin Arg Leu Lys Ala Leu Glu Lys Ala Gly 
10 15 20 

tat ctt gat cct gac cag gcg ggt aaa etc cag tea ggg gaa ctg ggt 
Tyr Leu Asp Pro Asp Gin Ala Gly Lys Leu Gin Ser Gly Glu Leu Gly 
25 30 35 40 

ttg acc cat gaa gcc ggc gac cac atg att gaa aac tac ate ggc tec 
Leu Thr His Glu Ala Gly Asp His Met lie Glu Asn Tyr He Gly Ser 

45 50 55 

tat acc etc cct ctg gga ctg gcc etc cac ttt tta etc gat ggc aag 
Tyr Thr Leu Pro Leu Gly Leu Ala Leu His Phe Leu Leu Asp Gly Lys 



54 



102 



150 



198 



246 
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60 65 70 

age tac eta gtc ccc atg get att gaa gag ccc tct gtc att gee get 
Ser Tyr Leu Val Pro Met Ala lie Glu Glu Pro Ser Val lie Ala Ala 
75 80 85 



gtc aag gaa aac egg ctg atg ate ggt caa gtg gtc ata gee gga age 

Val Lys Glu Asn Arg Leu Met lie Gly Gin Val Val lie Ala Gly Ser 
105 HO 115 120 

aca aaa cct age cag gac egg gga aaa ate ctg age cac cag caa gac 

Thr Lys Pro Ser Gin Asp Arg Gly Lys He Leu Ser His Gin Gin Asp 

125 130 135 

tta ate gac eta gee aat get age tat ccc tea att ggt aaa aga ggg 
Leu He Asp Leu Ala Asn Ala Ser Tyr Pro Ser He Gly Lys Arg Gly 

140 145 150 



294 



gee age aac ggt gee aag atg gta gee caa age ggt ggt ttc cat aca 342 
Ala Ser Asn Gly Ala Lys Met Val Ala Gin Ser Gly Gly Phe His Thr 
90 95 100 



390 



438 



486 



630 



678 



726 



ggt ggg gee cga ggc att caa gtc aaa cag ttt gac tea gac ctg ggc 534 
Gly Gly Ala Arg Gly He Gin Val Lys Gin Phe Asp Ser Asp Leu Gly 
155 160 165 

cag gat atg gga age tat ctg gca gtc tac ttg act gtt gac tgc cag 582 
Gin Asp Met Gly Ser Tyr Leu Ala Val Tyr Leu Thr Val Asp Cys Gin 
170 175 180 

gaa gee atg ggg get aac att ate aac ace atg ctg gaa gec ctg get 
Glu Ala Met Gly Ala Asn He He Asn Thr Met Leu Glu Ala Leu Ala 
185 190 195 200 

cct gaa att gac cgc eta acc age ggc cag gtc ttg atg tec ate tta 
Pro Glu He Asp Arg Leu Thr Ser Gly Gin Val Leu Met Ser He Leu 

205 210 215 

tct aac ctg gee act gaa tec ctt gtc act gtt tec tgt caa gta aaa 
Ser Asn Leu Ala Thr Glu Ser Leu Val Thr Val Ser Cys Gin Val Lys 

220 225 230 

ccc aga ttt tta gtc aaa aat gac atg gca ggg gaa get gtc egg gac 
Pro Arg Phe Leu Val Lys Asn Asp Met Ala Gly Glu Ala Val Arg Asp 
235 240 245 

caa ate ate cag gee tac cag tat gee tgc ctg gac ccc tac egg gca 
Gin He He Gin Ala Tyr Gin Tyr Ala Cys Leu Asp Pro Tyr Arg Ala 
250 255 260 

gee acc cac aac aag ggg ate atg aac ggg gta gac ggc ttg gtc eta 
Ala Thr His Asn Lys Gly He Met Asn Gly Val Asp Gly Leu Val Leu 
265 270 275 280 

get agt ggg aat gat tgg egg gca ate gaa gcg ggg gee cat get tac 
Ala Ser Gly Asn Asp Trp Arg Ala He Glu Ala Gly Ala His Ala Tyr 

285 290 295 



774 



822 



870 



918 
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get agt ttg acc ggc cac tac cgc ccc ttg tec aag tgg gaa aag ace 
Ala Ser Leu Thr Gly His Tyr Arg Pro Leu Ser Lys Trp Glu Lys Thr 

300 305 310 

caa gac gga cag tta aaa ggg acc att acc ctt ccc ttg cca att gec 
Gin Asp Gly Gin Leu Lys Gly Thr He Thr Leu Pro Leu Pro He Ala 
315 320 325 

aca gtt ggt ggg get att gee tec cac cct gta gec caa gtt age cag 
Thr Val Gly Gly Ala He Ala Ser His Pro Val Ala Gin Val Ser Gin 
330 335 340 

caa ate tta ggc caa cct act get aag caa tta gee egg ctg gtt gca 
Gin He Leu Gly Gin Pro Thr Ala Lys Gin Leu Ala Arg Leu Val Ala 
345 350 355 360 

gca gtg gga eta gee cag aac eta tec get ctt cgt gec tta gtc aca 
Ala Val Gly Leu Ala Gin Asn Leu Ser Ala Leu Arg Ala Leu Val Thr 

365 370 375 

act ggt att caa caa gga cac atg gee etc cag gca agg tct ttg gec 
Thr Gly He Gin Gin Gly His Met Ala Leu Gin Ala Arg Ser Leu Ala 

380 385 390 



966 



aag aac atg gaa gaa gac taa 
Lys Asn Met Glu Glu Asp 
425 430 



1014 



1062 



1110 



1158 



1206 



atg aat gec ggg gee egg gga gac aag ate caa aag ctg gca gac cgc 1254 
Met Asn Ala Gly Ala Arg Gly Asp Lys He Gin Lys Leu Ala Asp Arg 
395 400 405 

tta att aac caa gac caa atg aac eta gca act gee cgt gee ctg etc 1302 
Leu He Asn Gin Asp Gin Met Asn Leu Ala Thr Ala Arg Ala Leu Leu 
410 415 420 



1323 



<210> 38 
<211> 430 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 38 

Met Leu Pro Leu Phe Lys Gin Phe 
1 5 



Leu Lys Ala Leu Glu Lys Ala Gly 

20 



Lys Leu Gin Ser Gly Glu Leu Gly 
35 40 



Tyr Lys Gin Ser Leu Ser Gin Arg 
10 15 

Tyr Leu Asp Pro Asp Gin Ala Gly 
25 30 

Leu Thr His Glu Ala Gly Asp His 

45 



Met He Glu Asn Tyr He Gly Ser Tyr Thr Leu Pro Leu Gly Leu Ala 
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50 55 60 



Leu His Phe Leu Leu Asp Gly Lys Ser Tyr Leu Val Pro Met Ala lie 
65 70 75 80 

Glu Glu Pro Ser Val lie Ala Ala Ala Ser Asn Gly Ala Lys Met Val 

85 90 95 



Ala Gin Ser Gly Gly Phe His Thr Val Lys Glu Asn Arg Leu Met lie 

100 105 110 



Gly Gin Val Val lie Ala Gly Ser Thr Lys Pro Ser Gin Asp Arg Gly 
115 120 125 



Lys He Leu Ser His Gin Gin Asp Leu He Asp Leu Ala Asn Ala Ser 
13 0 135 140 



Tyr Pro Ser He Gly Lys Arg Gly Gly Gly Ala Arg Gly He Gin Val 
145 150 155 160 



Lys Gin Phe Asp Ser Asp Leu Gly Gin Asp Met Gly Ser Tyr Leu Ala 

165 170 175 



Val Tyr Leu Thr Val Asp Cys Gin Glu Ala Met Gly Ala Asn He He 

180 185 190 



Asn Thr Met Leu Glu Ala Leu Ala Pro Glu He Asp Arg Leu Thr Ser 
195 200 205 



Gly Gin Val Leu Met Ser He Leu Ser Asn Leu Ala Thr Glu Ser Leu 
210 215 220 



Val Thr Val Ser Cys Gin Val Lys Pro Arg Phe Leu Val Lys Asn Asp 
225 " 230 235 240 



Met Ala Gly Glu Ala Val Arg Asp Gin He He Gin Ala Tyr Gin Tyr 

245 250 255 



Ala Cys Leu Asp Pro Tyr Arg Ala Ala Thr His Asn Lys Gly He Met 

260 265 270 



Asn Gly Val Asp Gly Leu Val Leu Ala Ser Gly Asn Asp Trp Arg Ala 
275 280 285 



WO 03/104391 



87/235 



PCT/US02/36122 



He Glu Ala Gly Ala His Ala Tyr Ala Ser Leu Thr Gly His Tyr Arg 
290 295 300 



Pro Leu Ser Lys Trp Glu Lys Thr Gin Asp Gly Gin Leu Lys Gly Thr 
305 " 310 315 320 



He Thr Leu Pro Leu Pro He Ala Thr Val Gly Gly Ala He Ala Ser 

325 330 335 



His Pro Val Ala Gin Val Ser Gin Gin He Leu Gly Gin Pro Thr Ala 

340 345 350 



Lys Gin Leu Ala Arg Leu Val Ala Ala Val Gly Leu Ala Gin Asn Leu 
355 360 365 



Ser Ala Leu Arg Ala Leu Val Thr Thr Gly He Gin Gin Gly His Met 
370 375 380 



Ala Leu Gin Ala Arg Ser Leu Ala Met Asn Ala Gly Ala Arg Gly Asp 
385 390 395 400 



Lys He Gin Lys Leu Ala Asp Arg Leu He Asn Gin Asp Gin Met Asn 

405 410 415 



Leu Ala Thr Ala Arg Ala Leu Leu Lys Asn Met Glu Glu Asp 

420 425 430 



<210> 39 
<211> 930 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (13) . . (930) 

<223> 

<400> 39 

aggattagta aa atg tta ttt gat cgt ate gta gaa gec ttt ccc gaa age 

Met Leu Phe Asp Arg He Val Glu Ala Phe Pro Glu Ser 
15 10 



51 



aac ate aaa aaa gat gaa ccc ttg tec tat tac tct tac act cga aca 99 
Asn He Lys Lys Asp Glu Pro Leu Ser Tyr Tyr Ser Tyr Thr Arg Thr 
15 20 25 
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ggt ggc ccg get gac att ttg att ttc cca gaa tec ate gat gaa att 
Gly Gly Pro Ala Asp lie Leu He Phe Pro Glu Ser He Asp Glu He 
30 35 40 45 

gtg acg att ate aag tgg ate aac caa agt ccg gaa tac caa get ggc 
Val Thr He He Lys Trp He Asn Gin Ser Pro Glu Tyr Gin Ala Gly 

50 55 60 

gat etc ccc etc act ate tta ggc aat get age aac ctg ate gta aaa 
Asp Leu Pro Leu Thr He Leu Gly Asn Ala Ser Asn Leu He Val Lys 

65 70 75 

gat ggt ggg afca aga ggg att acc ate att ace ace ggc att aaa acc 
Asp Gly Gly He Arg Gly He Thr He He Thr Thr Gly He Lys Thr 
80 85 90 

att tgt cac gaa gag aac egg ate act gcg ggc get gga gca get att 
He Cys His Glu Glu Asn Arg He Thr Ala Gly Ala Gly Ala Ala He 
95 100 105 

ate gat gtt age cag get gee ttg gac cat age tta act ggc ttg gaa 
lie Asp Val Ser Gin Ala Ala Leu Asp His Ser Leu Thr Gly Leu Glu 
110 115 120 125 



147 



get ggg get tac ggt ggg gaa gtc cag cat tgt gtt gaa agt gtc caa 
Ala Gly Ala Tyr Gly Gly Glu Val Gin His Cys Val Glu Ser Val Gin 

145 150 155 



aac ttc tec tac cgc cac agt tat ttg atg gaa gaa gac gat ata gta 
Asn Phe Ser Tyr Arg His Ser Tyr Leu Met Glu Glu Asp Asp He Val 
175 180 185 

gtc tec gtg acc ttt aaa ttg gag teg ggc gac tac ate act ate aag 
Val Ser Val Thr Phe Lys Leu Glu Ser Gly Asp Tyr He Thr He Lys 
190 195 200 205 

gaa aag atg gat gaa tta acc tac ctt aga gaa tec aaa caa ccg ctg 
Glu Lys Met Asp Glu Leu Thr Tyr Leu Arg Glu Ser Lys Gin Pro Leu 

210 215 220 



gga gee cag gta tec gaa aaa cat gee ggt ttt ate att aat ata ggc 



195 



243 



291 



339 



387 



ttc get tgt ggc ata ccg ggt agt aca ggc ggg get gtt tac atg aac 435 
Phe Ala Cys Gly He Pro Gly Ser Thr Gly Gly Ala Val Tyr Met Asn 

130 135 140 



483 



gtc ttg acc egg cat ggc cag ttg aag acc tat agt aat gcg gaa atg 531 
Val Leu Thr Arg His Gly Gin Leu Lys Thr Tyr Ser Asn Ala Glu Met 
160 *" 165 170 



579 



627 



675 



gaa tac ccc tct tgt ggg tea gtc ttt aaa aga cct gaa ggc cac ttt 723 
Glu Tyr Pro Ser Cys Gly Ser Val Phe Lys Arg Pro Glu Gly His Phe 

225 230 235 

acg ggg aaa tta ate cag gat get ggc ctt caa gga ttg gtc cat ggt 771 
Thr Gly Lys Leu He Gin Asp Ala Gly Leu Gin Gly lieu Val His Gly 
240 245 250 



819 
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Gly Ala Gin Val Ser Glu Lys His Ala Gly Phe lie lie Asn He Gly 
255 260 265 

aat get acc gee age gac tac caa gag ttg ate caa cat ate caa gaa 
Asn Ala Thr Ala Ser Asp Tyr Gin Glu Leu He Gin His He Gin Glu 
270 275 280 285 



ata ggg gag gat tag 
He Gly Glu Asp 

305 



<210> 40 
<211> 305 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 40 

Met Leu Phe Asp Arg He Val Glu Ala Phe Pro Glu Ser Asn He Lys 
1 5 10 15 

Lys Asp Glu Pro Leu Ser Tyr Tyr Ser Tyr Thr Arg Thr Gly Gly Pro 

20 25 30 



Ala Asp He Leu He Phe Pro Glu Ser He Asp Glu He Val Thr He 
3 5 40 45 



He Lys Trp He Asn Gin Ser Pro Glu Tyr Gin Ala Gly Asp Leu Pro 
50 55 60 



Leu Thr He Leu Gly Asn Ala Ser Asn Leu He Val Lys Asp Gly Gly 
65 70 75 80 



He Arg Gly He Thr He He Thr Thr Gly He Lys Thr He Cys His 

85 90 95 



Glu Glu Asn Arg He Thr Ala Gly Ala Gly Ala Ala He He Asp Val 

100 105 110 



Ser Gin Ala Ala Leu Asp His Ser Leu Thr Gly Leu Glu Phe Ala Cys 
115 120 125 



867 



gaa gtc tac egg att tac aag gtt aag ctg gaa cgt gaa gtt cgc att 915 
Glu Val Tyr Arg He Tyr Lys Val Lys Leu Glu Arg Glu Val Arg He 

290 295 300 



930 



Gly He Pro Gly Ser Thr Gly Gly Ala Val Tyr Met Asn Ala Gly Ala 
130 135 140 
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Tyr Gly Gly Glu Val Gin His Cys Val Glu Ser Val Gin Val Leu Thr 
145 150 155 160 

Arg His Gly Gin Leu Lys Thr Tyr Ser Asn Ala Glu Met Asn Phe Ser 

165 170 175 

Tyr Arg His Ser Tyr Leu Met Glu Glu Asp Asp He Val Val Ser Val 

180 185 190 

Thr Phe Lys Leu Glu Ser Gly Asp Tyr He Thr He Lys Glu Lys Met 
195 200 205 

Asp Glu Leu Thr Tyr Leu Arg Glu Ser Lys Gin Pro Leu Glu Tyr Pro 
210 215 220 

Ser Cys Gly Ser Val Phe Lys Arg Pro Glu Gly His Phe Thr Gly Lys 
225 230 235 240 

Leu He Gin Asp Ala Gly Leu Gin Gly Leu Val His Gly Gly Ala Gin 

245 250 255 

Val Ser Glu Lys His Ala Gly Phe He He Asn He Gly Asn Ala Thr 

260 265 270 

Ala Ser Asp Tyr Gin Glu Leu He Gin His He Gin Glu Glu Val Tyr 
275 280 285 

Arg He Tyr Lys Val Lys Leu Glu Arg Glu Val Arg He He Gly Glu 
290 295 300 



Asp 
305 



<210> 41 
<211> 1104 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (1104) 

<223> 

<400> 41 

aaagctggtg ttttc atg gtt tat age tta agg att ccg ggg aaa ctt tat bi 
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Met Val Tyr Ser Leu Arg lie Pro Gly Lys Leu Tyr 
15 10 

ttg gca ggt gaa tac gca gta gta acc ccc ggc tat gcc ggg ate ttg 
Leu Ala Gly Glu Tyr Ala Val Val Thr Pro Gly Tyr Ala Gly lie Leu 
15 20 25 

ctg aca gtc age egg tat ttg act tta gac att tgg gaa aca tct ccc 
Leu Thr Val Ser Arg Tyr Leu Thr Leu Asp He Trp Glu Thr Ser Pro 
30 35 40 

gac caa get tea gtc agg tct caa aca tat ggc aac cag gcc tat get 
Asp Gin Ala Ser Val Arg Ser Gin Thr Tyr Gly Asn Gin Ala Tyr Ala 
45 50 55 60 

tgg gag egg tta gat ggt ate ttt age ttt aag gac tgg tec cac ccc 
Trp Glu Arg Leu Asp Gly He Phe Ser Phe Lys Asp Trp Ser His Pro 

65 70 75 

ttc cac eta gtc gaa acg gtg ate caa aca gtg gaa gcc tac ata gaa 
Phe His Leu Val Glu Thr Val lie Gin Thr Val Glu Ala Tyr He Glu 

80 85 90 

tec ttg tec ctg cct tta aaa agt tac ggg att cag ate aag age cag 
Ser Leu Ser Leu Pro Leu Lys Ser Tyr Gly He Gin He Lys Ser Gin 
95 100 105 

ttg gac tac cag ggc aaa aaa att ggc ctg ggg tct agt ggg gcc gtt 
Leu Asp Tyr Gin Gly Lys Lys He Gly Leu Gly Ser Ser Gly Ala Val 
110 115 120 

acc ate get gtt ate cga ggc ctg age ctt ctt tac gac etc cac tta 
Thr He Ala Val lie Arg Gly Leu Ser Leu Leu Tyr Asp Leu His Leu 
125 130 135 140 

aaa gac ata gac att ttc aaa eta get gcc ate gcc cat ate cag eta 
Lys Asp He Asp He Phe Lys Leu Ala Ala He Ala His He Gin Leu 

145 150 155 



99 



147 



195 



243 



291 



339 



387 



43 5 



483 



aag age aag ggg tct ttt ggc gat ttg gca gcc tgc act tat act ggt 531 
Lys Ser Lys Gly Ser Phe Gly Asp Leu Ala Ala Cys Thr Tyr Thr Gly 

160 165 170 



579 



627 



gtg ate cgc tac cag tec ctg gat aga gaa tgg tta caa gaa caa ate 
Val He Arg Tyr Gin Ser Leu Asp Arg Glu Trp Leu Gin Glu Gin He 
175 180 185 

tec aac cat tec ate aag gac etc ctg gcc atg gat tgg cct age eta 
Ser Asn His Ser He Lys Asp Leu Leu Ala Met Asp Trp Pro Ser Leu 
190 195 200 

ggt eta gac egg etc age ctg ccc cat gac etc agg ctt tta ate gga 
Gly Leu Asp Arg Leu Ser Leu Pro His Asp Leu Arg Leu Leu He Gly 
205 ~ ~ 210 215 220 

tgg acc ggc cag cct gcc tec aca gaa aaa ttg gtt cag get gtc tac 723 
Trp Thr Gly Gin Pro Ala Ser Thr Glu Lys Leu Val Gin Ala Val Tyr 



675 
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225 230 235 

ccc caa aaa ata acc agg acc ccc ttg gac ttc cag tec ttc tta gac 
Pro Gin Lys He Thr Arg Thr Pro Leu Asp Phe Gin Ser Phe Leu Asp 

240 245 250 

caa tec caa gag tgt gtc gac ggc ttg gtg gag tct tta age cag get 
Gin Ser Gin. Glu Cys Val Asp Gly Leu Val Glu Ser Leu Ser Gin Ala 
255 260 265 

» 

gac tec cag gca age tta get tgg ate caa aag aac cga acc etc etc 
Asp Ser Gin Ala Ser Leu Ala Trp He Gin Lys Asn Arg Thr Leu Leu 
270 275 280 



acc tac ttg tgc gat att gtc gcg aaa tac gga ggc caa gee aag tct 
Thr Tyr Leu Cys Asp He Val Ala Lys Tyr Gly Gly Gin Ala Lys Ser 

305 310 315 



age cca ata gaa gee ate tac egg gaa tgg atg gat gca ggt ate ttg 
Ser Pro He Glu Ala lie Tyr Arg Glu Trp Met Asp Ala Gly He Leu 
335 340 345 

ccc tta aga eta gac att gta gaa aat ggt get tgc tat gac taa 
Pro Leu Arg Leu Asp He Val Glu Asn Gly Ala Cys Tyr Asp 
350 355 360 



771 



819 



867 



aag gca atg ggc caa age egg ggg aaa gtc ate gaa acc aaa gee ttg 915 
Lys Ala Met Gly Gin Ser Arg Gly Lys Val He Glu Thr Lys Ala Leu 
285 290 295 300 



963 



tec ggt gee ggc ggt gga gat tgt ggc att ggc eta ate aca agg gag 1011 
Ser Gly Ala Gly Gly Gly Asp Cys Gly He Gly Leu He Thr Arg Glu 

320 325 330 



1059 



1104 



<210> 42 
<211> 362 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 42 

Met Val Tyr Ser Leu Arg He Pro Gly Lys Leu Tyr Leu Ala Gly Glu 
15 10 15 



Tyr Ala Val Val Thr Pro Gly Tyr Ala Gly He Leu Leu Thr Val Ser 

20 25 30 

Arg Tyr Leu Thr Leu Asp He Trp Glu Thr Ser Pro Asp Gin Ala Ser 
35 40 45 

Val Arg Ser Gin Thr Tyr Gly Asn Gin Ala Tyr Ala Trp Glu Arg Leu 
50 55 60 
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Asp Gly lie Phe Ser Phe Lys Asp Trp Ser His Pro Phe His Leu Val 
65 ^ 70 75 80 



Glu Thr Val lie Gin Thr Val Glu Ala Tyr lie Glu Ser Leu Ser Leu 

85 90 95 



Pro Leu Lys Ser Tyr Gly lie Gin lie Lys Ser Gin Leu Asp Tyr, Gin 

100 105 110 



Gly Lys Lys He Gly Leu Gly Ser Ser Gly Ala Val Thr He Ala Val 
115 120 125 



He Arg Gly Leu Ser Leu Leu Tyr Asp Leu His Leu Lys Asp He Asp 
130 135 140 



lie Phe Lys Leu Ala Ala He Ala His He Gin Leu Lys Ser Lys Gly 
145 150 155 160 



Ser Phe Gly Asp Leu Ala Ala Cys Thr Tyr Thr Gly Val He Arg Tyr 

165 170 175 



Gin Ser Leu Asp Arg Glu Trp Leu Gin Glu Gin He Ser Asn His Ser 

180 185 190 



He Lys Asp Leu Leu Ala Met Asp Trp Pro Ser Leu Gly Leu Asp Arg 
195 200 205 



Leu Ser Leu Pro His Asp Leu Arg Leu Leu He Gly Trp Thr Gly Gin 
210 215 220 



Pro Ala Ser Thr Glu Lys Leu Val Gin Ala Val Tyr Pro Gin Lys He 
225 230 235 240 



Thr Arg Thr Pro Leu Asp Phe Gin Ser Phe Leu Asp Gin Ser Gin Glu 

245 250 255 



Cys Val Asp Gly Leu Val Glu Ser Leu Ser Gin Ala Asp Ser Gin Ala 

260 265 270 



Ser Leu Ala Trp He Gin Lys Asn Arg Thr Leu Leu Lys Ala Met Gly 
275 280 285 



Gin Ser Arg Gly Lys Val He Glu Thr Lys Ala Leu Thr Tyr Leu Cys 
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290 295 300 

Asp He Val Ala Lys Tyr Gly Gly Gin Ala Lys Ser Ser Gly Ala Gly 
305 310 315 320 

Gly Gly Asp Cys Gly He Gly Leu He Thr Arg Glu Ser Pro He Glu 

325 330 335 

Ala He Tyr Arg Glu Trp Met Asp Ala Gly He Leu Pro Leu Arg Leu 

340 345 350 



Asp He Val Glu Asn Gly Ala Cys Tyr Asp 
355 360 



<210> 43 
<211> 1023 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (13) . . (1023) 

<223> 

<400> 43 

gagaagccaa cc atg act aag cag gcc ttt gaa aag aaa aag tta ggc egg bi 

Met Thr Lys Gin Ala Phe Glu Lys Lys Lys Leu Gly Arg 
1 5 10 - 



att tgc egg gcc cat acc aac att gcc ttg ate aag tac tgg ggt aag 
He Cys Arg Ala His Thr Asn He Ala Leu He Lys Tyr Trp Gly Lys 
15 20 25 

get gat agg gac ttg att ate ccc aat aac aac tec eta tct tta acc 
Ala Asp Arg Asp Leu He He Pro Asn Asn Asn Ser Leu Ser Leu Thr 
30 35 40 45 

ttg gac get ttt tat acc gat acc cag gta gtt ttt gac cca gac ttg 
Leu Asp Ala Phe Tyr Thr Asp Thr Gin Val Val Phe Asp Pro Asp Leu 

50 55 60 

gac cag gac caa tta tgg eta gac ggg aaa cag gaa aaa ggg tec gcc 
Asp Gin Asp Gin Leu Trp Leu Asp Gly Lys Gin Glu Lys Gly Ser Ala 

65 70 75 

tta acc aag gcc cag gtc ate ctg gac ttg gtt egg gac caa gcc cag 
Leu Thr Lys Ala Gin Val He Leu Asp Leu Val Arg Asp Gin Ala Gin 
80 85 90 

ctt gac tgg ccg gcc aaa att acc age cac aac caa gtt gcc act gca 
Leu Asp Trp Pro Ala Lys He Thr Ser His Asn Gin Val Ala Thr Ala 
95 100 105 



99 



147 



195 



243 



291 



339 
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get ggc ttg get tec tct get tct ggt ctg gee gee ttg gcg ggt get 

Ala Gly Leu Ala Ser Ser Ala Ser Gly Leu Ala Ala Leu Ala Gly Ala 

110 115 120 125 

tea get gat get tta gac ctt ggc eta tec cca act gac etc tec cga 

Ser Ala Asp Ala Leu Asp Leu Gly Leu Ser Pro Thr Asp Leu Ser Arg 

130 135 140 

ttg gee cgc agg gga tct ggg tct gee tea cga agt att ttt ggt ggt 

Leu Ala Arg Arg Gly Ser Gly Ser Ala Ser Arg Ser He Phe Gly Gly 

145 150 155 



387 



ccc ate gac ttg gee cag tgg gat att gee atg etc ttt gtc att gta 
Pro lie Asp Leu Ala Gin Trp Asp He Ala Met Leu Phe Val He Val 
175 180 185 

age gac cga cca aag gca att tec tec age caa ggc atg caa ttg ace 
Ser Asp Arg Pro Lys Ala He Ser Ser Ser Gin Gly Met Gin Leu Thr 
190 195 200 205 



gac eta gca gac ate aag tec get ate caa gee caa gac etc gac cag 
Asp Leu Ala Asp He Lys Ser Ala He Gin Ala Gin Asp Leu Asp Gin 

225 230 235 

gtt ggg tec att gca gaa aga aat gee ctg aaa atg cat gec acc aac 
Val Gly Ser He Ala Glu Arg Asn Ala Leu Lys Met His Ala Thr Asn 
240 245 250 

ctg gca gee aag ccc ccc ttc acc tat tgg act aaa gaa agt tta gee 
Leu Ala Ala Lys Pro Pro Phe Thr Tyr Trp Thr Lys Glu Ser Leu Ala 
255 260 265 

ctg atg cag gaa gta tgg gac egg cgc aag get ggc cag tec etc tac 
Leu Met Gin Glu Val Trp Asp Arg Arg Lys Ala Gly Gin Ser Leu Tyr 
270 275 280 285 



gac ctt aaa gee ttc aaa gca gac etc age caa gac tgg ccc gac aag 
Asp Leu Lys Ala Phe Lys Ala Asp Leu Ser Gin Asp Trp Pro Asp Lys 

3.05 310 315 

cat ctt gtc tta get aaa ccg ggt cca ggc ctg gee ttt att gat gga 
His Leu Val Leu Ala Lys Pro Gly Pro Gly Leu Ala Phe He Asp Gly 
320 325 330 



435 



483 



ttt gtc gag tgg gaa aag ggt cat gat gat age tct tec ttt gee aag 531 
Phe Val Glu Trp Glu Lys Gly His Asp Asp Ser Ser Ser Phe Ala Lys 
160 ^ 165 170 



579 



627 



cag gag acg teg gac ttt tac cag gec tgg tta gac age ctg gac caa 675 
Gin Glu Thr Ser Asp Phe Tyr Gin Ala Trp Leu Asp Ser Leu Asp Gin 

210 215 220 



723 



771 



819 



867 



ttc acc atg gac gee ggc ccc aat gtc aag gtt att ggc agg gaa get 915 
Phe Thr Met Asp Ala Gly Pro Asn Val Lys Val He Gly Arg Glu Ala 

290 295 300 



963 



1011 
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cct ttg aac tag 
Pro Leu Asn 
335 



<210> 44 
<211> 336 
<212> PRT 

<213> Alloiococcus otitidis 

<400> 44 . ^ _ 

Met Thr Lys Gin Ala Phe Glu Lys Lys Lys Leu Gly Arg He Cys Arg 
1 5 10 15 

Ala His Thr Asn He Ala Leu He Lys Tyr Trp Gly Lys Ala Asp Arg 

20 25 30 

Asp Leu He He Pro Asn Asn Asn Ser Leu Ser Leu Thr Leu Asp Ala 
35 40 45 

Phe Tyr Thr Asp Thr Gin Val Val Phe Asp Pro Asp Leu Asp Gin Asp 
50 55 60 

Gin Leu Trp Leu Asp Gly Lys Gin Glu Lys Gly Ser Ala Leu Thr Lys 
65 70 75 80 

Ala Gin Val He Leu Asp Leu Val Arg Asp Gin Ala Gin Leu Asp Trp 

85 90 95 

Pro Ala Lys He Thr Ser His Asn Gin Val Ala Thr Ala Ala Gly Leu 

100 105 HO 

Ala Ser Ser Ala Ser Gly Leu Ala Ala Leu Ala Gly Ala Ser Ala Asp 
115 120 125 

Ala Leu Asp Leu Gly Leu Ser Pro Thr Asp Leu Ser Arg Leu Ala Arg 
130 135 140 

Arg Gly Ser Gly Ser Ala Ser Arg Ser He Phe Gly Gly Phe Val Glu 
145 150 155 160 

Trp Glu Lys Gly His Asp Asp Ser Ser Ser Phe Ala Lys Pro He Asp 

165 170 175 

Leu Ala Gin Trp Asp He Ala Met Leu Phe Val He Val Ser Asp Arg 

180 185 190 



1023 
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Pro Lys Ala lie Ser Ser Ser Gin Gly Met Gin Leu Thr Gin Glu Thr 
195 200 205 



Ser Asp Phe Tyr Gin Ala Trp Leu Asp Ser Leu Asp Gin Asp Leu Ala 
210 215 220 



Asp He Lys Ser Ala He Gin Ala Gin Asp Leu Asp Gin Val Gly Ser 
225 230 235 240 



He Ala Glu Arg Asn Ala Leu Lys Met His Ala Thr Asn Leu Ala Ala 

245 250 255 



Lys Pro Pro Phe Thr Tyr Trp Thr Lys Glu Ser Leu Ala Leu Met Gin 

260 265 270 



Glu Val Trp Asp Arg Arg Lys Ala Gly Gin Ser Leu Tyr Phe Thr Met 
275 280 285 



Asp Ala Gly Pro Asn Val Lys Val He Gly Arg Glu Ala Asp Leu Lys 
290 295 300 



Ala Phe Lys Ala Asp Leu Ser Gin Asp Trp Pro Asp Lys His Leu Val 
305 310 315 320 



Leu Ala Lys Pro Gly Pro Gly Leu Ala Phe He Asp Gly Pro Leu Asn 

325 330 335 



<210> 45 
<211> 981 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (28) . . (981) 

<223> 

<400> 45 

acaaaaatag acaaaggaga caaaagg atg acg ctt gtt aaa aat gta gcc aaa 54 

Met Thr Leu Val Lys Asn Val Ala Lys 
1 5 



ggc act gcc cat ggt aaa att att tta ate ggt gag cat get gtt gtc 



102 
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Gly Thr Ala His Gly Lys lie lie Leu lie Gly Glu His Ala Val Val 
10 15 20 25 

tat aac atg ccg gcc ate gec etc cct ttt ace aca gee ace ate ace 
Tyr Asn Met Pro Ala lie Ala Leu Pro Phe Thr Thr Ala Thr lie Thr 

30 35 40 

gtt gaa gtt agt cct tac caa ggc aaa age tat eta gaa agt get tgc 
Val Glu Val Ser Pro Tyr Gin Gly Lys Ser Tyr Leu Glu Ser Ala Cys 

45 50 55 

tac tgc gga tct tta gac caa gcg ccc ggg gac ttg gca ggg ctt caa 
Tyr Cys Gly Ser Leu Asp Gin Ala Pro Gly Asp Leu Ala Gly Leu Gin 
60 65 70 

gcc tgt ttg aca gcg gtt tgt gcc gac tta gac cag tec age gac cac 
Ala Cys Leu Thr Ala Val Cys Ala Asp Leu Asp Gin Ser Ser Asp His 
75 80 85 

ttg tat ate aag gtc gac age atg ate cct get gaa aga gga atg ggg 
Leu Tyr lie Lys Val Asp Ser Met He Pro Ala Glu Arg Gly Met Gly 
90 95 100 105 

tec agt get get gtg gcc ace gcc tta gtc aag gcc etc ttt cac tac 
Ser Ser Ala Ala Val Ala Thr Ala Leu Val Lys Ala Leu Phe His Tyr 

110 115 120 

ttc caa gtc gac tta age agt gaa gcc etc tea gcc tat gtc gag att 
Phe Gin Val Asp Leu Ser Ser Glu Ala Leu Ser Ala Tyr Val Glu He 

125 130 I 35 

gcc gaa aaa att acc cat ggc aag cca teg ggt ctg gat get aca gtc 
Ala Glu Lys He Thr His Gly Lys Pro Ser Gly Leu Asp Ala Thr Val 
140 145 150 

gtc aac tec att gcc ccc gtt tat ttt aaa cgc aac cag ctt ccc aag 
Val Asn Ser He Ala Pro Val Tyr Phe Lys Arg Asn Gin Leu Pro Lys 
155 160 165 

gcc ate cct tta aat gtt gac ggc tat tta att gca gcc gat act ggg 
Ala He Pro Leu Asn Val Asp Gly Tyr Leu He Ala Ala Asp Thr Gly 
170 175 180 185 

att aag ggc cac acg aaa gaa gcc gtt ggg gat gtg gcg aag ctg gtt 
He Lys Gly His Thr Lys Glu Ala Val Gly Asp Val Ala Lys Leu Val 

190 195 200 

gaa act gcc aag gtt caa acc atg gac att gtc cac cac etc ggc cag 
Glu Thr Ala Lys Val Gin Thr Met Asp He Val His His Leu Gly Gin 

205 210 215 

ctt acc cac cag get aaa aaa gca ate atg acc aat aac etc cct ggc 
Leu Thr His Gin Ala Lys Lys Ala He Met Thr Asn Asn Leu Pro Gly 
220 225 230 

tta ggg gag att ttg aac cag tec cac caa etc tta aag gat tta act 
Leu Gly Glu He Leu Asn Gin Ser His Gin Leu Leu Lys Asp Leu Thr 



150 



198 



246 



294 



342 



390 



438 



486 



534 



582 



630 



678 



726 



774 
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235 240 245 

gtc age aat ccc aag tta gac caa ctt gtc caa gca gec caa gat get 
Val Ser Asn Pro Lys Leu Asp Gin Leu Val Gin Ala Ala Gin Asp Ala 
250 255 260 265 

gga get tgc gga get aag tta ace ggt ggg ggc egg ggt ggt tgc atg 
Gly Ala Cys Gly Ala Lys Leu Thr Gly Gly Gly Arg Gly Gly Cys Met 

270 275 280 

att gee eta gee caa age aac cag gat gee tec aat att gee caa aaa 
lie Ala Leu Ala Gin Ser Asn Gin Asp Ala Ser Asn lie Ala Gin Lys 

285 290 295 

ttg gaa aaa gcg gga gee att gaa acc tgg ate cac ccc tta gga gaa 
Leu Glu Lys Ala Gly Ala He Glu Thr Trp He His Pro Leu Gly Glu 
300 305 310 

gee aac cat gac taa 
Ala Asn His Asp 
315 



<210> 46 
<211> 317 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 46 

Met Thr Leu Val Lys Asn Val Ala Lys Gly Thr Ala His Gly Lys lie 
X 5 10 15 

He Leu He Gly Glu His Ala Val Val Tyr Asn Met Pro Ala He Ala 

20 25 30 

Leu Pro Phe Thr Thr Ala Thr He Thr Val Glu Val Ser Pro Tyr Gin 
35 40 45 

Gly Lys Ser Tyr Leu Glu Ser Ala Cys Tyr Cys Gly Ser Leu Asp Gin 
50 . 55 60 

Ala Pro Gly Asp Leu Ala Gly Leu Gin Ala Cys Leu Thr Ala Val Cys 
65 70 75 80 

Ala Asp Leu Asp Gin Ser Ser Asp His Leu Tyr He Lys Val Asp Ser 

85 90 95 



822 



870 



918 



966 



981 



Met He Pro Ala Glu Arg Gly Met Gly Ser Ser Ala Ala Val Ala Thr 

100 105 110 
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Ala Leu Val Lys Ala Leu Phe His Tyr Phe Gin Val Asp Leu Ser Ser 
115 120 125 



Glu Ala Leu Ser Ala Tyr Val Glu lie Ala Glu Lys lie Thr His Gly 
130 135 140 



Lys Pro Ser Gly Leu Asp Ala Thr Val Val Asn Ser lie Ala Pro Val 
145 150 155 160 



Tyr Phe Lys Arg Asn Gin Leu Pro Lys Ala lie Pro Leu Asn Val Asp 

165 170 175 



Gly Tyr Leu lie Ala Ala Asp Thr Gly lie Lys Gly His Thr Lys Glu 

180 185 190 



Ala Val Gly Asp Val Ala Lys Leu Val Glu Thr Ala Lys Val Gin Thr 
195 200 205 



Met Asp He Val His His Leu Gly Gin Leu Thr His Gin Ala Lys Lys 
210 215 220 



Ala He Met Thr Asn Asn Leu Pro Gly Leu Gly Glu He Leu Asn Gin 
225 230 235 240 



Ser His Gin Leu Leu Lys Asp Leu Thr Val Ser Asn Pro Lys Leu Asp 

245 250 255 



Gin Leu Val Gin Ala Ala Gin Asp Ala Gly Ala Cys Gly Ala Lys Leu 

260 265 270 



Thr Gly Gly Gly Arg Gly Gly -Cys Met He Ala Leu Ala Gin Ser Asn 
275 280 285 



Gin Asp Ala Ser Asn He Ala Gin Lys Leu Glu Lys Ala Gly Ala He 
290 295 300 



Glu Thr Trp He His Pro Leu Gly Glu Ala Asn His Asp 
305 310 315 



<210> 47 
<211> 975 
<212> DNA 

<213> Alloiococcus otitidis 
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<220> 

<221> CDS 

<222> (46) . . (975) 

<223> 

<400> 47 

agaatcaaat ttgttttaaa attatcagct tttggaggtc agaac atg aac aat tec 57 

Met Asn Asn Ser 
1 



cgt att ttt tta gtc tat gac cgt aaa gac tgg cag tct ctt aga gaa 
Arg He Phe Leu Val Tyr Asp Arg Lys Asp Trp Gin Ser Leu Arg Glu 
5 10 15 20 



gtg aat gac gtc ata teg atg gaa gat gtc cga gaa gtt tac gtc ccc 

Val Asn Asp Val He Ser Met Glu Asp Val Arg Glu Val Tyr Val Pro 

40 45 50 

att ate caa tta ctg gat gtc tac ata aaa agt tac tac cgc cac cag 

He He Gin Leu Leu Asp Val Tyr He Lys Ser Tyr Tyr Arg His Gin 
55 60 65 



aag gta gac etc etc aca aca gat ggc ttc ctt tat ccg aat aag att 
Lys Val Asp Leu Leu Thr Thr Asp Gly Phe Leu Tyr Pro Asn Lys He 

120 125 130 



105 



aat gee age ctt tct tta acg gaa aaa aac eta aat aac ttg cgt gca 153 
Asn Ala Ser Leu Ser Leu Thr Glu Lys Asn Leu Asn Asn Leu Arg Ala 

25 30 35 



201 



249 



get tec ttg ate aat tac ttg aac ctg gac cag cct aaa aag tac caa 297 

Ala Ser Leu He Asn Tyr Leu Asn Leu Asp Gin Pro Lys Lys Tyr Gin 
70 75 80 

ccc tat gtg att ggg att gca ggg age gtg get gtg ggc aag tct acg 345 

Pro Tyr Val He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr 
85 90 95 100 

gtt gee agg ctt ctt aag tec etc ttg age gac tac tat ccg gaa aaa 3 93 

Val Ala Arg Leu Leu Lys Ser Leu Leu Ser Asp Tyr Tyr Pro Glu Lys 

105 110 115 



441 



tta aaa gag cga gat ate atg gac cgc aag ggt ttt ccc gaa age tat 489 
Leu Lys Glu Arg Asp He Met Asp Arg Lys Gly Phe Pro Glu Ser Tyr 
135 140 145 

gat atg aaa cgt ttg att aac ttt atg ace gat gtc aaa aat aat gtt 537 
Asp Met Lys Arg Leu He Asn Phe Met Thr Asp Val Lys Asn Asn Val 
150 155 160 

ccc aac ate cag gtg ccc aag tat tec cac caa gtt tac gac ata gta 585 
Pro Asn He Gin Val Pro Lys Tyr Ser His Gin Val Tyr Asp He Val 
165 170 175 180 

gaa ggg gaa agg ttg ace att aac cag cca gac ate ttg att gtc gaa 633 
Glu Gly Glu Arg Leu Thr lie Asn Gin Pro Asp He Leu He Val Glu 

185 190 195 
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ggg ate aat gtg etc caa ctt cct tct aat gag aag att ttt gtt age 
Gly He Asn Val Leu Gin Leu Pro Ser Asn Glu Lys He Phe Val Ser 

200 205 210 

gat ttt ttc gac ttc tec ttt tat gtg gat gee tea gaa aat ctg att 
Asp Phe Phe Asp Phe Ser Phe Tyr Val Asp Ala Ser Glu Asn Leu He 
215 220 225 

gaa aaa tgg tac atg caa cgc ttt ggc acc ttt atg gat acc gee ttc 
Glu Lys Trp Tyr Met Gin Arg Phe Gly Thr Phe Met Asp Thr Ala Phe 
230 235 240 

caa gac ccc aac aac tat tac tac aag ttt aat gac tgg gac cgc aag 
Gin Asp Pro Asn Asn Tyr Tyr Tyr Lys Phe Asn Asp Trp Asp Arg Lys 
245 ~ 250 255 260 

gaa get ttt gee tat gee aac caa gtt tgg gaa acg gtt aac eta gaa 
Glu Ala Phe Ala Tyr Ala Asn Gin Val Trp Glu Thr Val Asn Leu Glu 

265 270 275 

aac etc agg gaa tat att eta ccc acc cga etc egg get aac etc ate 
Asn Leu Arg Glu Tyr He Leu Pro Thr Arg Leu Arg Ala Asn Leu He 

280 285 290 

etc cat aaa acc cat aac cac tac ate gac aag att tta etc aaa aaa 
Leu His Lys Thr His Asn His Tyr He Asp Lys He Leu Leu Lys Lys 
295 300 305 

cac tga 
His 



<210> 48 
<211> 309 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 48 

Met Asn Asn Ser Arg He Phe Leu Val Tyr Asp Arg Lys Asp Trp Gin 
1 5 10 15 



Ser Leu Arg Glu Asn Ala Ser Leu Ser Leu Thr Glu Lys Asn Leu Asn 

20 25 30 



Asn Leu Arg Ala Val Asn Asp Val He Ser Met Glu Asp Val Arg Glu 
35 40 45 



Val Tyr- Val Pro He lie Gin Leu Leu Asp Val Tyr He Lys Ser Tyr 
50 55 60 



681 



729 



777 



825 



873 



921 



969 



975 



Tyr Arg His Gin Ala Ser Leu He Asn Tyr Leu Asn Leu Asp Gin Pro 
65 70 75 80 
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Lys Lys Tyr Gin Pro Tyr Val lie Gly lie Ala Gly Ser Val Ala Val 

85 90 95 



Gly Lys Ser Thr Val Ala Arg Leu Leu Lys Ser Leu Leu Ser Asp Tyr 

100 105 110 

Tyr Pro Glu Lys Lys Val Asp Leu Leu Thr Thr Asp Gly Phe Leu Tyr 
115 120 125 

Pro Asn Lys He Leu Lys Glu Arg Asp He Met Asp Arg Lys Gly Phe 
130 135 140 

Pro Glu Ser Tyr Asp Met Lys Arg Leu He Asn Phe Met Thr Asp Val 
145 150 155 160 

Lys Asn Asn Val Pro Asn lie Gin Val Pro Lys Tyr Ser His Gin Val 

165 170 175 

Tyr Asp He Val Glu Gly Glu Arg Leu Thr He Asn Gin Pro Asp He 

180 185 190 

Leu He Val Glu Gly He Asn Val Leu Gin Leu Pro Ser Asn Glu Lys 
195 200 205 

He Phe Val Ser Asp Phe Phe Asp Phe Ser Phe Tyr Val Asp Ala Ser 
210 215 220 

Glu Asn Leu He Glu Lys Trp Tyr Met Gin Arg Phe Gly Thr Phe Met 
225 230 235 240 

Asp Thr Ala Phe Gin Asp Pro Asn Asn Tyr Tyr Tyr Lys Phe Asn Asp 

245 250 255 

Trp Asp Arg Lys Glu Ala Phe Ala Tyr Ala Asn Gin Val Trp Glu Thr 

260 265 270 

Val Asn Leu Glu Asn Leu Arg Glu Tyr He Leu Pro Thr Arg Leu Arg 
275 280 285 



Ala Asn Leu He Leu His Lys Thr His Asn His Tyr He Asp Lys He 
290 295 300 
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Leu Leu Lys Lys His 
305 



<210> 49 
<211> 846 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (846) 

<223> 

<400> 49 

agctta atg gga gac gat tta aga gaa gaa att ctt gac cga atg aag 

Met Gly Asp Asp Leu Arg Glu Glu lie Leu Asp Arg Met Lys 
1 5 10 

. gtc caa gcc caa att aac ccc aat gag gaa att cgc egg acc att gac 
Val Gin Ala Gin lie Asn Pro Asn Glu Glu lie Arg Arg Thr lie Asp 
15 20 25 30 

ttt ate aag gac tat etc cag gcc cac ccc ttc ttt gaa tec tta ate 
Phe lie Lys Asp Tyr Leu Gin Ala His Pro Phe Phe Glu Ser Leu lie 

35 40 45 

ttg ggc ate tec ggt ggc cag gat tec acc etc ctg ggt aag eta gcc 
Leu Gly lie Ser Gly Gly Gin Asp Ser Thr Leu Leu Gly Lys Leu Ala 

50 55 60 

cag atg gcc tgc ctt gaa ctg agg gaa gag gag ggg tct gac aag cca 
Gin Met Ala Cys Leu Glu Leu Arg Glu Glu Glu Gly Ser Asp Lys Pro 
65 70 75 

att ttt att ggt ate cgc eta cct tat ggg gat caa ttt gat gaa gca 
He Phe He Gly lie Arg Leu Pro Tyr Gly Asp Gin Phe Asp Glu Ala 
80 85 90 

gaa gcc cag caa gcc etc aat tgg ate cag cct gac cag get ctg acc 
Glu Ala Gin Gin Ala Leu Asn Trp He Gin Pro Asp Gin Ala Leu Thr 
95 100 105 110 

att aat ate aaa gag tec gtt gat ggc ctg gtt gac act ttg gcc ggc 
He Asn lie Lys Glu Ser Val Asp Gly Leu Val Asp Thr Leu Ala Gly 

115 120 125 

caa ggc att gaa gtt tct gac ttt aac aag ggc aat ate aaa get egg 
Gin Gly He Glu Val Ser Asp Phe Asn Lys Gly Asn He Lys Ala Arg 

130 135 140 

ate cga atg gtg gcc caa tat ggc gta gcg ggt cac ttc cac ggg gcg 
He Arg Met Val Ala Gin Tyr Gly Val Ala Gly His Phe His Gly Ala 
145 150 155 

gtg tta gga tct gac cat tea gcc gaa aat gta act ggc ttt ttc acc 
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Val Leu Gly Ser Asp His Ser Ala Glu Asn Val Thr Gly Phe Phe Thr 
160 165 170 

aag cat ggg gac ggc get agt gac etc aac cct ctt ttc cgc eta aat 576 
Lys His Gly Asp Gly Ala Ser Asp Leu Asn Pro Leu Phe Arg Leu Asn 
175 180 185 190 

aaa cgt cag gga egg gee ctg ctt gag gaa tta ggg tec cct aag aac 624 
Lys Arg Gin Gly Arg Ala Leu Leu Glu Glu Leu Gly Ser Pro Lys Asn 

195 200 205 

ttg tac caa aag ace ccc aca get gat ttg gaa gaa gac cag ccc ggc 672 
Leu Tyr Gin Lys Thr Pro Thr Ala Asp Leu Glu Glu Asp Gin Pro Gly 

210 215 220 

ttg tea gat gaa gac aag tta ggg gtt tct tat gaa gee att gat gac 720 
Leu Ser Asp Glu Asp Lys Leu Gly Val Ser Tyr Glu Ala lie Asp Asp 
225 230 235 

tac ttg gag ggc aag cca gtt age cag gag gac cag gca ace ate gaa 768 
Tyr Leu Glu Gly Lys Pro Val Ser Gin Glu Asp Gin Ala Thx lie Glu 
240 245 250 

aaa tgg tat caa caa acg gee cac aag cgc cac ttg ccg gtg act ate 816 
Lys Trp Tyr Gin Gin Thr Ala His Lys Arg His Leu Pro Val Thr lie 
255 260 265 270 

ttt gat gat ttt tgg aaa gaa aaa aat tag 846 
Phe Asp Asp Phe Trp Lys Glu Lys Asn 

275 



<210> 50 
<211> 279 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 50 

Met Gly Asp Asp Leu Arg Glu Glu lie Leu Asp Arg Met Lys Val Gin 
15 10 15 



Ala Gin lie Asn Pro Asn Glu Glu lie Arg Arg Thr He Asp Phe He 

20 25 30 



Lys Asp Tyr Leu Gin Ala His Pro Phe Phe Glu Ser Leu He Leu Gly 
35 40 45 



lie Ser Gly Gly Gin Asp Ser Thr Leu Leu Gly Lys Leu Ala Gin Met 
50 55 60 



Ala Cys Leu Glu Leu Arg Glu Glu Glu Gly Ser Asp Lys Pro He Phe 
65 "* 70 75 80 
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He Gly lie Arg Leu Pro Tyr Gly Asp Gin Phe Asp Glu Ala Glu Ala 

85 90 95 



Gin Gin Ala Leu Asn Trp He Gin Pro Asp Gin Ala Leu Thr He Asn 

100 105 110 

He Lys Glu Ser Val Asp Gly Leu Val Asp Thr Leu Ala Gly Gin Gly 
115 120 125 

He Glu Val Ser Asp Phe Asn Lys Gly Asn He Lys Ala Arg He Arg 
130 135 140 

Met Val Ala Gin Tyr Gly Val Ala Gly His Phe His Gly Ala Val Leu 
145 150 155 160 

Gly Ser Asp His Ser Ala Glu Asn Val Thr Gly Phe Phe Thr Lys His 

165 170 175 



Gly Asp Gly Ala Ser Asp Leu Asn Pro Leu Phe Arg Leu Asn Lys Arg 

180 185 190 



Gin Gly Arg Ala Leu Leu Glu Glu Leu Gly Ser Pro Lys Asn Leu Tyr 
195 200 205 

Gin Lys Thr Pro Thr Ala Asp Leu Glu Glu Asp Gin Pro Gly Leu Ser 
210 215 220 

Asp Glu Asp Lys Leu Gly Val Ser Tyr Glu Ala He Asp Asp Tyr Leu 
225 230 235 240 

Glu Gly Lys Pro Val Ser Gin Glu Asp Gin Ala Thr He Glu Lys Trp 

245 250 255 



Tyr Gin Gin Thr Ala His Lys Arg His Leu Pro Val Thr He Phe Asp 

260 265 270 



Asp Phe Trp Lys Glu Lys Asn 
275 



<210> 51 
<211> 843 
<212> DNA 

<213> Alloiococcus otitidis 
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<220> 

<221> CDS 

<222> (7) . . (843) 

<223> 

<400> 51 

aggaac atg att atg tat act gat ggg att ggc ttt att gat tea gga 48 

Met lie Met Tyr Thr Asp Gly lie Gly Phe lie Asp Ser Gly 
15 10 

gtg ggt ggc ttc acc ctg gtc aaa gaa gec atg aag caa ttg cca aat 96 

Val Gly Gly Phe Thr Leu Val Lys Glu Ala Met Lys Gin Leu Pro Asn 
15 20 25 30 

gaa caa ttt tac tat ctg gga gac acc gec egg tea cct tat gga cct 144 

Glu Gin Phe Tyr Tyr Leu Gly Asp Thr Ala Arg Ser Pro Tyr Gly Pro 

35 40 45 

aaa gac atg gee act gtc aag gca tat gec ttt gaa ctt gec aat tac 192 

Lys Asp Met Ala Thr Val Lys Ala Tyr Ala Phe Glu Leu Ala Asn Tyr 

50 55 60 

ctg gtt aaa aac cac cag ate aaa ate ttg gtg ate get tgt aat act 240 

Leu Val Lys Asn His Gin lie Lys lie Leu Val lie Ala Cys Asn Thr 
65 70 75 

gcg act gtc get gec etc aag gac eta aaa cag gee ttg ccc ate cca 288 

Ala Thr Val Ala Ala Leu Lys Asp Leu Lys Gin Ala Leu Pro lie Pro 
80 85 90 

gtt tta ggg gtc ate tta cct ggt tgc cga gca get att aag get agt 33 6 

Val Leu Gly Val lie Leu Pro Gly Cys Arg Ala Ala He Lys Ala Ser 
95 100 105 110 

gtt aac cat cag att ggg gtt att gee acc cat ggg acc ate cag tec 3 84 

Val Asn His Gin He Gly Val He Ala Thr His Gly Thr He Gin Ser 

115 120 125 

ggt cgc tat gag ctt gaa ctt aaa egg aaa cga ccg gat att gaa gtg 432 

Gly Arg Tyr Glu Leu Glu Leu Lys Arg Lys Arg Pro Asp He Glu Val 

130 135 140 

aca agt ctg get tgt ccc gaa ttt gee ccc atg gta gag gcg gga gac 480 

Thr Ser Leu Ala Cys Pro Glu Phe Ala Pro Met Val Glu Ala Gly Asp 
145 150 155 

tac cga tct gtt caa get age agt gtg gtg agg aca tec tta cag gec 528 

Tyr Arg Ser Val Gin Ala Ser Ser Val Val Arg Thr Ser Leu Gin Ala 
160 165 . 170 

eta gaa gac caa gat ttg gat acc ctt att ttg ggt tgc acc cac tat 576 

Leu Glu Asp Gin Asp Leu Asp Thr Leu He Leu Gly Cys Thr His Tyr 
175 180 185 190 

ccc att ata aaa gac etc att caa gac tct att ggc cct ggt ate age 624 

Pro He He Lys Asp Leu He Gin Asp Ser He Gly Pro Gly He Ser 
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195 200 205 

ttg gtt gat cca ggg gcg gaa get gtg aat gac ttg agt gtc tta tta 
Leu Val Asp Pro Gly Ala Glu Ala Val Asn Asp Leu Ser Val Leu Leu 

210 215 220 

gac tat tat gac ttg act aat gac egg ttt aat ccc aac ctg acc cac 
Asp Tyr Tyr Asp Leu Thr Asn Asp Arg Phe Asn Pro Asn Leu Thr His 
225 230 235 

cat ttt tac acc acg gga gat aaa gec ggg ttt aag aaa ate gcg gat 
His Phe Tyr Thr Thr Gly Asp Lys Ala Gly Phe Lys Lys lie Ala Asp 
240 245 250 

gac tgg ctt gac cac cac aac tac egg gtt gac cat tta gat tta gag 
Asp Trp Leu Asp His His Asn Tyr Arg Val Asp His Leu Asp Leu Glu 
255 260 265 270 

gag ttg.caa gaa gtt aat gga aga taa 
Glu Leu Gin Glu Val Asn Gly Arg 

275 



<210> 52 
<211> 278 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 52 

Met lie Met Tyr Thr Asp Gly He Gly Phe He Asp Ser Gly Val Gly 
15 10 15 

Gly Phe Thr Leu Val Lys Glu Ala Met Lys Gin Leu Pro Asn Glu Gin 

20 25 30 



Phe Tyr Tyr Leu Gly Asp Thr Ala Arg Ser Pro Tyr Gly Pro Lys Asp 
35 40 45 

Met Ala Thr Val Lys Ala Tyr Ala Phe Glu Leu Ala Asn Tyr Leu Val 
50 55 60 

Lys Asn His Gin He Lys He Leu Val He Ala Cys Asn Thr Ala Thr 
65 70 75 80 

Val Ala Ala Leu Lys Asp Leu Lys Gin Ala Leu Pro He Pro Val Leu 

85 90 95 



672 



720 



768 



816 



843 



Gly Val He Leu Pro Gly Cys Arg Ala Ala He Lys Ala Ser Val Asn 

100 105 110 
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His Gin He Gly Val He Ala Thr His Gly Thr He Gin Ser Gly Arg 
115 120 125 



Tyr Glu Leu Glu Leu Lys Arg Lys Arg Pro Asp He Glu Val Thr Ser 
130 135 140 



Leu Ala Cys Pro Glu Phe Ala Pro Met Val Glu Ala Gly Asp Tyr Arg 
145 150 155 160 



Ser Val Gin Ala Ser Ser Val Val Arg Thr Ser Leu Gin Ala Leu Glu 

165 170 175 



Asp Gin Asp Leu Asp Thr Leu lie Leu Gly Cys Thr His Tyr Pro He 

180 185 190 



He Lys Asp Leu He Gin Asp Ser He Gly Pro Gly He Ser Leu Val 
195 200 205 



Asp Pro Gly Ala Glu Ala Val Asn Asp Leu Ser Val Leu Leu Asp Tyr 
210 215 220 



Tyr Asp Leu Thr Asn Asp Arg Phe Asn Pro Asn Leu Thr His His Phe 
225 230 235 240 



Tyr Thr Thr Gly Asp Lys Ala Gly Phe Lys Lys He Ala Asp Asp Trp 

245 250 255 



Leu Asp His His Asn Tyr Arg" Val Asp His Leu Asp Leu Glu Glu Leu 

260 265 270 



Gin Glu Val Asn Gly Arg 
275 



<210> 53 
<211> 957 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (957) 

<223> 

<400> 53 

aaaaat atg acg aag gag tct tea ttt atg gtc aag acc aaa ata tgt 48 
Met Thr Lys Glu Ser Ser Phe Met Val Lys Thr Lys He Cys 
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1 5 10 

tct att tta aat ata aca ccg gat tea ttt tct gat ggt ggg cgc aac 96 

Ser lie Leu Asn lie Thr Pro Asp Ser Phe Ser Asp Gly Gly Arg Asn 
15 20 25 30 

tat cag gca gac caa gec ata get cac gga etc gac ttg gta gac aag 144 

Tyr Gin Ala Asp Gin Ala lie Ala His Gly Leu Asp Leu Val Asp Lys 

35 40 45 

gga gcg gac atg ttg gat att gga ggt gag teg ace egg cct ggt tec 192 

Gly Ala Asp Met Leu Asp lie Gly Gly Glu Ser Thr Arg Pro Gly Ser 

50 55 60 

agt cca gtc gac etc caa gat gaa ate gac cgt att gta ccg gtg ate 240 

Ser Pro Val Asp Leu Gin Asp Glu He Asp Arg He Val Pro Val He 
65 70 75 

aag gga ate aga gaa aaa agt cag gtt cct att tea gta gat acc tac 288 

Lys Gly He Arg Glu Lys Ser Gin Val Pro He Ser Val Asp Thr Tyr 
80 85 90 

egg get cca gtt gee aaa gcg get att gat get ggg gcg gat ate ate 336 

Arg Ala Pro Val Ala Lys Ala Ala He Asp Ala Gly Ala Asp He He 
95 100 105 110 

aat gat att acc ggt eta act ggt gat gta gac atg gee gac ttg eta 384 

Asn Asp He Thr Gly Leu Thr Gly Asp Val Asp Met Ala Asp Leu Leu 

115 120 125 

get caa gaa ggg gtt aag gee att gtc atg ttc aac ccg gtt att get 432 

Ala Gin Glu Gly Val Lys Ala He Val Met Phe Asn Pro Val He Ala 

130 135 140 

cga cct gac cac cca tct tec caa aaa ttc aga gat ttc ggg ggc cga 480 

Arg Pro Asp His Pro Ser Ser Gin Lys Phe Arg Asp Phe Gly Gly Arg 
145 150 155 

gat ttt ttc acc gat gaa gaa aga gat aaa atg tec caa gca ccc att 528 

Asp Phe Phe Thr Asp Glu Glu Arg Asp Lys Met Ser Gin Ala Pro He 
160 165 * 170 

gaa gag gec atg atg gtc tac ttt gac aaa gtc ttg aac aag gee cat 576 

Glu Glu Ala Met Met Val Tyr Phe Asp Lys Val Leu Asn Lys Ala His 
175 180 185 190 

caa get ggg att gac egg gat aag att tta ctg gac ccg gga att ggc 624 

Gin Ala Gly He Asp Arg Asp Lys He Leu Leu Asp Pro Gly He Gly 

195 200 205 

ttt ggc ctg acc aag aag gaa aat tac aag ttg att cac agt gtt gee 672 

Phe Gly Leu Thr Lys Lys Glu Asn Tyr Lys Leu He His Ser Val Ala 

210 215 220 

teg att cat gac aag ggc tac ccg gtc ttt tta gga gtt tec cgc aaa 720 

Ser lie His Asp Lys Gly Tyr Pro Val Phe Leu Gly Val Ser Arg Lys 
225 230 "* 235 
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cgc ttc ttg gtg ggg gaa gtc tec aag eta ggc ate gaa gee gac cca 
Arg Phe Leu Val Gly Glu Val Ser Lys Leu Gly He Glu Ala Asp Pro 
240 245 250 

gag ace caa gca gga ttt tta aac cga gac ctg get tea get att att 
Glu Thr Gin Ala Gly Phe Leu Asn Arg Asp Leu Ala Ser Ala He He 
255 260 265 270 



768 



816 



aca get tac get age cat ata ggg gta gac tat gtc egg gtt cat tec 864 
Thr Ala Tyr Ala Ser His lie Gly Val Asp Tyr Val Arg Val His Ser 

275 280 285 

« 

tta gat gaa cac aaa ata gca ace ace att ace cat aat att tta aac 912 
Leu Asp Glu His Lys He Ala Thr Thr He Thr His Asn He Leu Asn 

290 295 300 

age gat age tta gat gat* cag age ttt gac caa tat aaa aat taa 957 
Ser Asp Ser Leu Asp Asp Gin Ser Phe Asp Gin Tyr Lys Asn 
305 310 315 



<210> 54 
<211> 316 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 54 

Met Thr Lys Glu Ser Ser Phe Met Val Lys Thr Lys He Cys Ser He 
15 10 15 



Leu Asn He Thr Pro Asp Ser Phe Ser Asp Gly Gly Arg Asn Tyr Gin 

20 25 30 



Ala Asp Gin Ala He Ala His Gly Leu Asp Leu Val Asp Lys Gly Ala 
35 40 45 



Asp Met Leu Asp He Gly Gly Glu Ser Thr Arg Pro Gly Ser Ser Pro 
50 " 55 60 



Val Asp Leu Gin Asp Glu He Asp Arg He Val Pro Val He Lys Gly 
65 70 75 80 



He Arg Glu Lys Ser Gin Val Pro He Ser Val Asp Thr Tyr Arg Ala 

85 90 95 



Pro Val Ala Lys Ala Ala He Asp Ala Gly Ala Asp He He Asn Asp 

100 105 110 



He Thr Gly Leu Thr Gly Asp Val Asp Met Ala Asp Leu Leu Ala Gin 
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Glu Gly Val Lys Ala lie Val Met Phe Asn Pro Val lie Ala Arg Pro 
130 135 140 



Asp His Pro Ser Ser Gin Lys Phe Arg Asp Phe Gly Gly Arg Asp Phe 
145 150 155 160 



Phe Thr Asp Glu Glu Arg Asp Lys Met Ser Gin Ala Pro He Glu Glu 

165 170 175 



Ala Met Met Val Tyr Phe Asp Lys Val Leu Asn Lys Ala His Gin Ala 

180 185 190 



Gly He Asp Arg Asp Lys He Leu Leu Asp Pro Gly He Gly Phe Gly 
195 200 205 



Leu Thr Lys Lys Glu Asn Tyr Lys Leu He His Ser Val Ala Ser He 
210 215 220 



His Asp Lys Gly Tyr Pro Val Phe Leu Gly Val Ser Arg Lys Arg Phe 
225 " 230 235 240 



Leu Val Gly Glu Val Ser Lys Leu Gly He Glu Ala Asp Pro Glu Thr 

245 250 255 



Gin Ala Gly Phe Leu Asn Arg Asp Leu Ala Ser Ala He He Thr Ala 

260 265 270 



Tyr Ala Ser His He Gly Val Asp Tyr Val Arg Val His Ser Leu Asp 
275 280 285 



Glu His Lys He Ala Thr Thr He Thr His Asn He Leu Asn Ser Asp 
290 295 300 



Ser Leu Asp Asp Gin Ser Phe Asp Gin Tyr Lys Asn 
3 05 310 315 



<210> 55 
<211> 561 
<212> DNA 

<213> Alloiococcus" otitidis 



<220> 
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<221> CDS 

<222> (28) . . (561) 

<223> 

<400> 55 

acaagaacta gacgaaaaat tagaacg ttg gga ata ttt aag cca ata tgt ata 54 

Met Gly lie Phe Lys Pro He Cys He 
1 5 



gga gag ata act atg ata gcc tac gtt tgg gcc caa gat gag caa gga 
Gly Glu He Thr Met He Ala Tyr Val Trp Ala Gin Asp Glu Gin Gly 
10 15 20 25 

ate att ggt aaa gac aag gtt ttg cct tgg gaa ttg tec aat gac tta 
He He Gly Lys Asp Lys Val Leu Pro Trp Glu Leu Ser Asn Asp Leu 

30 35 40 

aag cat ttt aaa aaa gtt aca gaa ggt cac acc ate ctg atg ggc egg 
Lys His Phe Lys Lys Val Thr Glu Gly His Thr He Leu Met Gly Arg 

45 50 55 

aag acc ttt gaa gga atg gat aaa aag ccc etc cct aac cga aaa acc 
Lys Thr Phe Glu Gly Met Asp Lys Lys Pro Leu Pro Asn Arg Lys Thr 
60 65 70 

ttg gta ttg acc cgc caa gat gac tac caa get ggg gac gac cag gtt 
Leu Val Leu Thr Arg Gin Asp Asp Tyr Gin Ala Gly Asp Asp Gin Val 
75 80 85 

gaa gtc gtc cac tec aaa gac cag gcc ttg act tat gcg tea ggt cat 
Glu Val Val His Ser Lys Asp Gin Ala Leu Thr Tyr Ala Ser Gly His 
90 95 100 105 

ggg gtg gac etc tat gtg att ggt ggg gcc ggc att ttc gac ttg ttt 
Gly Val Asp Leu Tyr Val He Gly Gly Ala Gly He Phe Asp Leu Phe 

110 115 120 

ctg gac caa gtt gat gtt etc cac caa aca gtt ate cac gag age ttt 
Leu Asp Gin Val Asp Val Leu His Gin Thr Val He His Glu Ser Phe 

125 130 135 



gtg tct aaa get tat tat gac cag get gac ggt cac aac cac tec cac 
Val Ser Lys Ala Tyr Tyr Asp Gin Ala Asp Gly His Asn His Ser His 
155 160 165 

acc att tat gaa tac aga aga aaa taa 
Thr He Tyr Glu Tyr Arg Arg Lys 
170 175 



102 



150 



198 



246 



294 



342 



390 



438 



gat ggt gac acc acc atg cca gac att gac tgg gac age ttt aat cag 486 
Asp Gly Asp Thr Thr Met Pro Asp lie Asp Trp Asp Ser Phe Asn Gin 
140 145 150 



534 



561 



<210> 56 
<211> 177 
<212> PRT 
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<213> Alloiococcus otitidis 

<400> 56 _ 

Met Gly He Phe Lys Pro He Cys He Gly Glu He Thr Met He Ala 

1 5 10 15 

Tyr Val Trp Ala Gin Asp Glu Gin Gly He He Gly Lys Asp Lys Val 

20 25 30 

Leu Pro Trp Glu Leu Ser Asn Asp Leu Lys His Phe Lys Lys Val Thr 
35 40 45 

Glu Gly His Thr He Leu Met Gly Arg Lys Thr Phe Glu Gly Met Asp 
50 55 60 

Lys Lys Pro Leu Pro Asn Arg Lys Thr Leu Val Leu Thr Arg Gin Asp 
65 70 75 80 

Asp Tyr Gin Ala Gly Asp Asp Gin Val Glu Val Val His Ser Lys Asp 

85 90 95 

Gin Ala Leu Thr Tyr Ala Ser Gly His Gly Val Asp Leu Tyr Val He 

100 105 HO 

Gly Gly Ala Gly He Phe Asp Leu Phe Leu Asp Gin Val Asp Val Leu 
115 120 125 

His Gin Thr Val He His Glu Ser Phe Asp Gly Asp Thr Thr Met Pro 
130 135 140 

Asp He Asp Trp Asp Ser Phe Asn Gin Val Ser Lys Ala Tyr Tyr Asp 
145 150 155 160 

Gin Ala Asp Gly His Asn His Ser His Thr He Tyr Glu Tyr Arg Arg 

165 170 175 



Lys 



<210> 57 
<211> 1968 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 
<221> CDS 
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<222> (7) . . (1968) 
<223> 



48 



96 



144 



192 



240 



288 



<400> 57 

agagat ttg acg aag gaa tct tac aat gat tea tct ata acc ata etc 
Met Thr Lvs Glu Ser Tyr Asn Asp Ser Ser lie Thr lie Leu 
15 10 

aag ggc tta gac gee gtt aag aaa aga cca ggc atg tat ate ggg tea 
Lys Gly Leu Asp Ala Val Lys Lys Arg Pro Gly Met Tyr lie Gly Ser 
15 - 20 25 30 

acc gat gee agg ggt ttg cac cac ctg gtt tat gaa att acc gat aat 
Thr Asp Ala Arg Gly Leu His His Leu Val Tyr Glu He Thr Asp Asn 

35 40 45 

get att gat gag gtt ttg get ggc tac get gat gaa att gaa gtc aag 
Ala He Asp Glu Val Leu Ala Gly Tyr Ala Asp Glu He Glu Val Lys 

50 55 60 

ate cac acg gac ggc teg gtt teg gtc aaa gac aat gga egg ggc atg 
He His Thr Asp Gly Ser Val Ser Val Lys Asp Asn Gly Arg Gly Met 
65 70 75 

cca acc ggg atg cat gag tea ggc eta ccc acc ate cag gtt ate ttt 
Pro Thr Gly Met His Glu Ser Gly Leu Pro Thr He Gin Val He Phe 
80 85 90 

acc gtc etc cat gec ggg gga aaa ttt ggc caa gag ggg gee tac aag 336 
Thr Val Leu His Ala Gly Gly Lys Phe Gly Gin Glu Gly Ala Tyr Lys 
95 100 105 HO * 

tea gec ggt gga etc cat ggg gtt ggg gee teg gtc gtc aac gee ttg 
Ser Ala Gly Gly Leu His Gly Val Gly Ala Ser Val Val Asn Ala Leu 

115 120 125 

tct gat tgg etc acg gtg ata gtg acc aag gac ggc tat gaa tac egg 
Ser Asp Trp Leu Thr Val He Val Thr Lys Asp Gly Tyr Glu Tyr Arg 

130 135 140 

caa gac ttt age caa gga ggc cag get aaa gga ggc ate cag aag aga 
Gin Asp Phe Ser Gin Gly Gly Gin Ala Lys Gly Gly He Gin Lys Arg 
145 150 155 

aaa att aac cag caa aaa tec age acc ctg gtc cac ttc aaa ccc tea 
Lys He Asn Gin Gin Lys Ser Ser Thr Leu Val His Phe Lys Pro Ser 
160 165 170 

ggc caa gtc ttt teg acc acc gaa ttt aac ttt aac acc ate tgt gag 
Gly Gin Val Phe Ser Thr Thr Glu Phe Asn Phe Asn Thr He Cys Glu 
175 180 - 185 190 

egg atg egg gag teg gee ttc ctt gtc aaa ggg acc aag att acc gta 
Arg Met Arg Glu Ser Ala Phe Leu Val Lys Gly Thr Lys He Thr Val 

195 200 205 



384 



432 



480 



528 



576 



624 



gag gac ctg cgc cag gaa gaa age cag gtc ttc caa ttt aat gaa gga 



672 
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Glu Asp Leu Arg Gin Glu Glu Ser Gin Val Phe Gin Phe Asn Glu Gly 

210 215 220 

att aag gcc ttt gtc gac tac tta aat gag ggc aag gat acc ttg agt 
lie Lys Ala Phe Val Asp Tyr Leu Asn Glu Gly Lys Asp Thr Leu Ser 
225 230 235 

cca gta acc tat ttt gaa ggt tct gaa gat gaa att gaa gtt gaa ttt 
Pro Val Thr Tyr Phe Glu Gly Ser Glu Asp Glu lie Glu Val Glu Phe 
240 245 250 

gcc ttc caa tac aat gac ggc tat teg gag acg gtt ctg agt ttt gtc 
Ala Phe Gin Tyr Asn Asp Gly Tyr Ser Glu Thr Val Leu Ser Phe Val 
255 260 265 270 

aac aat gtc cgt acc egg gat ggg ggc age cac gaa act gga get aag 
Asn Asn Val Arg Thr Arg Asp Gly Gly Ser His Glu Thr Gly Ala Lys 

275 280 285 

tea get att acc aag get ttc aac gac tat get agg aaa agt ggc tta 
Ser Ala He Thr Lys Ala Phe Asn Asp Tyr Ala Arg Lys Ser Gly Leu 

290 295 300 

etc aaa gag aaa gac agt aac ttg gaa gga tct gac gtc egg gaa ggg 
Leu Lys Glu Lys Asp Ser Asn Leu Glu Gly Ser Asp Val Arg Glu Gly 
305 " 310 315 

att gcg gtt gtt tta tec gtc cgt ate cca gaa gag att etc caa ttt 
He Ala Val Val Leu Ser .Val Arg He Pro Glu Glu He Leu Gin Phe 
320 325 330 

gaa ggc cag acc aag age aag tta gga act cct caa gcc egg acc gcc 
Glu Gly Gin Thr Lys Ser Lys Leu Gly Thr Pro Gin Ala Arg Thr Ala 
335 340 345 350 

act gac cag gtt ate tea gaa tec tta act tac ttc ctg gcc gaa aat 
Thr Asp Gin Val He Ser Glu Ser Leu Thr Tyr Phe Leu Ala Glu Asn 

355 360 365 

ggg gac ttg tct aag caa ctt att cgc aag gcc ate cga gcc egg tct 
Gly Asp Leu Ser Lys Gin Leu He Arg Lys Ala He Arg Ala Arg Ser 

370 375 380 

gcc agg gaa gca get cgc aag gcc aag gac cag tec egg aac tct get 
Ala Arg Glu Ala Ala Arg Lys Ala Lys Asp Gin Ser Arg Asn Ser Ala 
385 390 395 

tec aag aaa aaa gtt gaa act etc ctg tct ggt aag ttg acc cca get 
Ser Lys Lys Lys Val Glu Thr Leu Leu Ser Gly Lys Leu Thr Pro Ala 
400 405 410 

caa age aag aac gcc cag aaa aat gaa ctt tac tta gtg gag ggg gat 
Gin Ser Lys Asn Ala Gin Lys Asn Glu Leu Tyr Leu Val Glu Gly Asp 
415 420 425 430 

teg get ggt ggg tea gcc aag caa ggt agg gac egg aaa ttc caa gca 
Ser Ala Gly Gly Ser Ala Lys Gin Gly Arg Asp Arg Lys Phe Gin Ala 
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435 440 445 

att ttg ccc ctg cgt gga aag gtt ate aac aca gaa aaa tct tct ttg 
lie Leu Pro Leu Arg Gly Lys Val lie Asn Thr Glu Lys Ser Ser Leu 

450 455 460 

gat gat att tta aaa aat gaa gaa att tct acc atg att tat acc ate 
Asp Asp He Leu Lys Asn Glu Glu He Ser Thr Met He Tyr Thr He 
465 470 475 

ggt gca ggt get ggg cct gag ttt gat att gaa get gtt aat tac gat 
Gly Ala Gly Ala Gly Pro Glu Phe Asp He Glu Ala Val Asn Tyr Asp 
480 485 490 

aag ata gtc att atg act gat gec gac aca gac ggc gec cac ate cag 
Lys He Val lie Met Thr Asp Ala Asp Thr Asp Gly Ala His He Gin 
495 500 505 510 

gtc ctt etc etc acc ttc ttt tac egg tac atg aaa ccc ctg att gaa 
Val Leu Leu Leu Thr Phe Phe Tyr Arg Tyr Met Lys Pro Leu He Glu 

515 520 525 

gca ggg aag gtc tat att gec eta ccg ccc ttg tat aag ttg acc aaa 
Ala Gly Lys Val Tyr He Ala Leu Pro Pro Leu Tyr Lys Leu Thr Lys 

530 535 540 

aag caa gga aag caa gaa aaa aca gec tat get tgg act gat gag gag 
Lys Gin Gly Lys Gin Glu Lys Thr Ala Tyr Ala Trp Thr Asp Glu Glu 
545 550 555 

ttg gaa gac ctg gtt aaa gat ttt ggc aaa cac tac act etc cag cgc 
Leu Glu Asp Leu Val Lys Asp Phe Gly Lys His Tyr Thr Leu Gin Arg 
560 565 570 



atg gac cca gag acc aga acc ttg ate egg gtc acc att gaa gac agt 
Met Asp Pro Glu Thr Arg Thr Leu He Arg Val Thr He Glu Asp Ser 

595 600 605 



cct aga egg aag tgg att gaa gac cat att gaa ttc agt ctg gca gaa 
Pro Arg Arg Lys Trp He Glu Asp His He Glu Phe Ser Leu Ala Glu 
625 " 630 635 

gat ggc agt att tta gag aac aag gtc eta gaa gga gag gec aag taa 
Asp Gly Ser He Leu Glu Asn Lys Val Leu Glu Gly Glu Ala Lys 
640 645 650 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



tac aag ggt tta ggc gag atg aat get gac cag ttg tgg gag acc acc 1776 
Tyr Lys Gly Leu Gly Glu Met Asn Ala Asp Gin Leu Trp Glu Thr Thr 
575 " 580 585 590 



1824 



gaa aag get gaa aga egg gtt tec acc ttg atg ggg acc aag gtg gat 1872 
Glu Lys Ala Glu Arg Arg Val Ser Thr Leu Met Gly Thr Lys Val Asp 

610 615 620 



1920 



1968 



<210> 58 
<211> 653 
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<212> PRT 

<213> Alloiococcus otitidis 

<400> 58 /~o 
Met Thr Lys Glu Ser Tyr Asn Asp Ser Ser He Thr He Leu Lys Gly 

1 5 10 15 

Leu Asp Ala Val Lys Lys Arg Pro Gly Met Tyr He Gly Ser Thr Asp 

20 25 30 

Ala Arg Gly Leu His His Leu Val Tyr Glu He Thr Asp Asn Ala He 
35 40 * 45 

Asp Glu Val Leu Ala Gly Tyr Ala Asp Glu He Glu Val Lys He His 
50 55 60 

Thr Asp Gly Ser Val Ser Val Lys Asp Asn Gly Arg Gly Met Pro Thr 
65 70 75 80 

Glv Met His Glu Ser Gly Leu Pro Thr He Gin Val He Phe Thr Val 

85 90 95 

Leu His Ala Gly Gly Lys Phe Gly Gin Glu Gly Ala Tyr Lys Ser Ala 

100 105 HO 

Gly Gly Leu His Gly Val Gly Ala Ser Val Val Asn Ala Leu Ser Asp 
115 120 125 

Trp Leu Thr Val He Val Thr Lys Asp Gly Tyr Glu Tyr Arg Gin Asp 
130 135 140 

Phe Ser Gin Gly Gly Gin Ala Lys Gly Gly He Gin Lys Arg Lys He 
145 150 155 160 

Asn Gin Gin Lys Ser Ser Thr Leu Val His Phe Lys Pro Ser Gly Gin 

165 170 175 

Val Phe Ser Thr Thr Glu Phe Asn Phe Asn Thr He Cys Glu Arg Met 

180 185 190 

Arg Glu Ser Ala Phe Leu Val Lys Gly Thr Lys He Thr Val Glu Asp 
195 200 205 

Leu Arg Gin Glu Glu Ser Gin Val Phe Gin Phe Asn Glu Gly He Lys 
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210 



215 220 



Ala Phe Val Asp Tyr Leu Asn Glu Gly Lys Asp Thr Leu Ser Pro Val 
225 230 235 240 

Thr Tyr Phe Glu Gly Ser Glu Asp Glu lie Glu Val Glu Phe Ala Phe 

245 250 255 

Gin Tyr Asn Asp Gly Tyr Ser Glu Thr Val Leu Ser Phe Val Asn Asn 

260 265 270 

Val Arg Thr Arg Asp Gly Gly Ser His Glu Thr Gly Ala Lys Ser Ala 
275 280 285 

lie Thr Lys Ala Phe Asn Asp Tyr Ala Arg Lys Ser Gly Leu Leu Lys 
290 295 300 

Glu Lys Asp Ser Asn Leu Glu Gly Ser Asp Val Arg Glu Gly He Ala 
305 310 315 

Val Val Leu Ser Val Arg He Pro Glu Glu He Leu Gin Phe Glu Gly 

325 330 335 

Gin Thr Lys Ser Lys Leu Gly Thr Pro Gin Ala Arg Thr Ala Thr Asp 

340 345 350 

Gin Val He Ser Glu Ser Leu Thr Tyr Phe Leu Ala Glu Asn Gly Asp 
355 360 365 

Leu Ser Lys Gin Leu He Arg Lys Ala He Arg Ala Arg Ser Ala Arg 
370 375 380 

Glu Ala Ala Arg Lys Ala Lys Asp Gin Ser Arg Asn Ser Ala Ser Lys 
385 390 395 400 

Lys Lys Val Glu Thr Leu Leu Ser Gly Lys Leu Thr Pro Ala Gin Ser 

405 410 415 

Lys Asn Ala Gin Lys Asn Glu Leu Tyr Leu Val Glu Gly Asp Ser Ala 

420 425 430 

Gly Gly Ser Ala Lys Gin Gly Arg Asp Arg Lys Phe Gin Ala He Leu 
435 440 445 
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Pro Leu Arg Gly Lys Val lie Asn Thr Glu Lys Ser Ser Leu Asp Asp 
450 455 460 

He Leu Lys Asn Glu Glu He Ser Thr Met He Tyr Thr He Gly Ala 
465 470 475 480 

Gly Ala Gly Pro Glu Phe Asp He Glu Ala Val Asn Tyr Asp Lys He 

485 490 495 

Val He Met Thr Asp Ala Asp Thr Asp Gly Ala His He Gin Val Leu 

500 505 510 

Leu Leu Thr Phe Phe Tyr Arg Tyr Met Lys Pro Leu He Glu Ala Gly 
515 520 525 

Lys Val Tyr He Ala Leu Pro Pro Leu Tyr Lys Leu Thr Lys Lys Gin 
530 535 540 

Glv Lys Gin Glu Lys Thr Ala Tyr Ala Trp Thr Asp Glu Glu Leu Glu 
545 550 555 560 

Asp Leu Val Lys Asp Phe Gly Lys His Tyr Thr Leu Gin Arg Tyr Lys 

565 570 575 

Gly Leu Gly Glu Met Asn Ala Asp Gin Leu Trp Glu Thr Thr Met Asp 

580 585 590 

Pro Glu Thr Arg Thr Leu He Arg Val Thr He Glu Asp Ser Glu Lys 
595 600 605 

Ala Glu Arg Arg Val Ser Thr Leu Met Gly Thr Lys Val Asp Pro Arg 
610 615 620 

Arg Lys Trp He Glu Asp His He Glu Phe Ser Leu Ala Glu Asp Gly 
625 630 635 640 

Ser He Leu Glu Asn Lys Val Leu Glu Gly Glu Ala Lys 

645 650 



<210> 59 
<211> 2463 
<212> DNA 
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<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (4) . . (2463) 

<223> 



<400> 59 

att atg gca gga gac caa gag acc agt aaa ata caa gaa tta acc tta 
Met Ala Gly Asp Gin Glu Thx Ser Lys He Gin Glu Leu Thr Leu 
1 1 5 10 15 

gaa gat gtc atg ggg gac egg ttc ggc egg tat tec aag tac att ata 
Glu Asp Val Met Gly Asp Arg Phe Gly Arg Tyr Ser Lys Tyr He He 

20 25 30 

cag gaa agg gec eta ccg gac ttg egg gac ggt tta aaa ccg gtc caa 
Gin Glu Arg Ala Leu Pro Asp Leu Arg Asp Gly Leu Lys Pro Val Gin 

35 40 45 

aga egg ate etc tat gee atg cac cag gac aaa aac acc tat gac aag 
Arg Arg He Leu Tyr Ala Met His Gin Asp Lys Asn Thr Tyr Asp Lys 
50 55 60 

get tac egg aag teg gee aag acg gtg gga aat gtc ata ggg aac tac 
Ala Tyr Arg Lys Ser Ala Lys Thr Val Gly Asn Val He Gly Asn Tyr 
65 70 75 

cac ccc cat ggc gac aca tec gtt tac gat gec atg gtt agg etc agt 
His Pro His Gly Asp Thr Ser Val Tyr Asp Ala Met Val Arg Leu Ser 
80 85 90 95 



ggg age atg gac ggg gac cca cca get gec atg egg tac acc gaa gee 
Gly Ser Met Asp Gly Asp Pro Pro Ala Ala Met Arg Tyr Thr Glu Ala 

115 120 125 

cgt ctg tct aaa att get tec gac etc ctg get gat att gat aag gag 
Arg Leu Ser Lys He Ala Ser Asp Leu Leu Ala Asp He Asp Lys Glu 
13 0 135 140 

acg gtg gac cat gtc tta aac ttt gat gac acg acc gag gag ccc acc 
Thr Val Asp His Val Leu Asn Phe Asp Asp Thr Thr Glu Glu Pro Thr 
145 150 155 

gtc tta ccc gec cgt ttt ccc aac etc ttg gtc aat ggg get age ggg 
Val Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser Gly 
160 165 170 175 

att tea gee ggt tat get act gac ata ccg ccc cat aat ttg age gag 
He Ser Ala Gly Tyr Ala Thr Asp He Pro Pro His Asn Leu Ser Glu 

180 185 190 



48 



96 



144 



192 



240 



288 



cag cct tgg aag atg cgc cat cct ttg gtt gat atg cac ggg aac aag 336 
Gin Pro Trp Lys Met Arg His Pro Leu Val Asp Met His Gly Asn Lys 

100 105 HO 



3 84 



432 



480 



528 



576 



gtg att gat gec acc ate cac tta ate aac cac ccc aat gca agg ctg 



624 
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Val He Asp Ala Thr He His Leu He Asn His Pro Asn Ala Arg Leu 

195 200 205 

gag act ttg atg gac tat att caa gga cca gac ttt ccg act ggg ggg 
Glu Thr Leu Met Asp Tyr He Gin Gly Pro Asp Phe Pro Thr Gly Gly 
210 * 215 220 

att ate caa ggt aaa agt ggc ctg aag aaa gec tac caa acg ggc aag 
He He Gin Gly Lys Ser Gly Leu Lys Lys Ala Tyr Gin Thr Gly Lys 
225 230 235 

gga aaa att ate ate egg gec aaa gca gat att gag gee ate egg ggt 
Gly Lys He He He Arg Ala Lys Ala Asp He Glu Ala He Arg Gly 
240 ~ 245 250 255 

ggc aaa tec caa att gtc ate agt caa att cct tat gag gtc aac aag 
Gly Lys Ser Gin He Val He Ser Gin He Pro Tyr Glu Val Asn Lys 

260 265 270 

gca agg ttg gtc caa aaa att gac gac ate egg att aac aaa aaa ate 
Ala Arg Leu Val Gin Lys He Asp Asp He Arg He Asn Lys Lys He 

275 280 285 



att gtg gtc gaa ace aaa aaa gat ggt gat ggg gaa ggg ate tta acc 
He Val Val Glu Thr Lys Lys Asp Gly Asp Gly Glu Gly He Leu Thr 
305 310 315 

tac ctg ctg aaa aac acc gac etc cag gta act tat aac tta aat atg 
Tyr Leu Leu Lys Asn Thr Asp Leu Gin Val Thr Tyr Asn Leu Asn Met 
320 325 330 335 

gta gee att gat aaa aaa cga ccc cag caa gtc tec etc aag caa ate 
Val Ala He Asp Lys Lys Arg Pro Gin Gin Val Ser Leu Lys Gin He 

340 345 350 



672 



720 



768 



816 



864 



gac ggc att gee gat gtc egg gat gaa agt gac egg tct ggc ttg egg 912 
Asp Gly He Ala Asp Val Arg Asp Glu Ser Asp Arg Ser Gly Leu Arg 
290 295 3 00 



960 



1008 



1056 



tta tct tct tac ttg gac 
Leu Ser Ser Tyr Leu Asp 

355 

cgt tac etc tta gee aag 
Arg Tyr Leu Leu Ala Lys 
370 

ctt ate aag gee att tea 
Leu lie Lys Ala lie Ser 
385 

gec agt gaa aac aag gee 
Ala Ser Glu Asn Lys Ala 
400 405 

ggt ttt age caa gac caa 
Gly Phe Ser Gin Asp Gin 



cac aag egg aca gtg gtt 
His Lys Arg Thr Val Val 
360 

gee aag gac cgc cag cac 
Ala Lys Asp Arg Gin His 

. 375 

ate ctg gat gac ttg ate 
lie Leu Asp Asp Leu He 
390 395 

aat gee aag gaa aat att 
Asn Ala Lys Glu Asn He 

410 

gee gaa gec att gtc tec 
Ala Glu Ala He Val Ser 



caa aac egg acc 1104 
Gin Asn Arg Thr 
365 

att gtc caa ggc 1152 

He Val Gin Gly 

380 

caa acc ate egg 1200 
Gin Thr He Arg 



ate cag get tat 1248 
lie Gin Ala Tyr 

415 

etc cag ctt tac 1296 
Leu Gin Leu Tyr 
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420 425 430 

cgc ttg acc aat aca gat ata aag gac tta caa gca gaa gcc aaa gac 
Arg Leu Thr Asn Thr Asp lie Lys Asp Leu Gin Ala Glu Ala Lys Asp 

435 440 445 

tta gcc caa gcc ate ctg acc tac cag gac etc tta acc aac aag gcc 
Leu Ala Gin Ala lie Leu Thr Tyr Gin Asp Leu Leu Thr Asn Lys Ala 
450 455 460 

age ctg gat get ttg atg aaa gaa gaa ttg aaa gaa gtc aaa caa gca 
Ser Leu Asp Ala Leu Met Lys Glu Glu Leu Lys Glu Val Lys Gin Ala 
465 470 475 

tat ggg gag gac egg eta acc cag gtc caa gac aag ate gaa aaa eta 
Tyr Gly Glu Asp Arg Leu Thr Gin Val Gin Asp Lys lie Glu Lys Leu 
480 ~ *" 485 49 0 495 



ttt atg caa gag ttg tea acc eta gac caa etc ctt att ttc acc teg 

Phe Met Gin Glu Leu Ser Thr Leu Asp Gin Leu Leu lie Phe Thr Ser 
545 550 555 

aaa ggc aat gtg gtc aac cga cca gtc cat gaa tta ccg gac ate aag 

Lys Gly Asn Val Val Asn Arg Pro Val His Glu Leu Pro Asp He Lys 

560 565 570 575 



gac gag gaa ttg att aag gtg tac cct tat egg gaa tta gat gcc ggc 
Asp Glu Glu Leu lie Lys Val Tyr Pro Tyr Arg Glu Leu Asp Ala Gly 

595 600 605 

aag cgc tat gtc ttt ate act cga gat ggc tat ate aaa caa agt cca 
Lys Arg Tyr Val Phe He Thr Arg Asp Gly Tyr He Lys Gin Ser Pro 
610 615 620 

gag acg gaa ttt gag ccc aaa cga act tac aag tct egg get tea act 
Glu Thr Glu Phe Glu Pro Lys Arg Thr Tyr Lys Ser Arg Ala Ser Thr 
625 630 635 

gcc att aaa tta aaa tea gac caa gat aga etc cag gca gtc tac tat 
Ala He Lys Leu Lys Ser Asp Gin Asp Arg Leu Gin Ala Val Tyr Tyr 
640 645 650 655 



1344 



1392 



1440 



1488 



gaa ata gaa acc caa gtc ctg gtc agt gaa gaa gac gtc atg gtt acc 153 6 

Glu He Glu Thr Gin Val Leu Val Ser Glu Glu Asp Val Met Val Thr 

500 505 510 

gtc acc cag gga ggt tac ttg aag egg acc tec ate egg tct tac aag 1584 

Val Thr Gin Gly Gly Tyr Leu Lys Arg Thr Ser He Arg Ser Tyr Lys 

515 520 525 

get tec caa gtg gag gaa ttg ggc egg cga gaa gac gac ttg gtc ate 1632 
Ala Ser Gin Val Glu Glu Leu Gly Arg Arg Glu Asp Asp Leu Val He 
530 535 540 



1680 



1728 



tgg aag gat att gga gag cac ttg tea agg acc ate ccc ctt gga gag 177 6 

Trp Lys Asp He Gly Glu His Leu Ser Arg Thr He Pro Leu Gly Glu 

580 585 590 



1824 



1872 



1920 



1968 
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att cct gac caa gaa gat tac gat gta ttc eta gec age tac aag ggc 
lie Pro Asp Gin Glu Asp Tyr Asp Val Phe Leu Ala Ser Tyr Lys Gly 

660 665 670 

tac ggg etc aag tat gga eta gaa gaa gtg tea gaa gta ggg gee cag 
Tyr Gly Leu Lys Tyr Gly Leu Glu Glu Val Ser Glu Val Gly Ala Gin 

675 " 680 685 



2016 



gat ggt ttg gtc ttt aag cgt aag cag ttc caa gaa gee ttg ttc att 
Asp Gly Leu Val Phe Lys Arg Lys Gin Phe Gin Glu Ala Leu Phe He 
705 710 715 

ace cag cga gec agt gtt aag aaa atg gee etc cat gac ttt gac egg 
Thr Gin Arg Ala Ser Val Lys Lys Met Ala Leu His Asp Phe Asp Arg 
720 725 730 735 

act tea egg gee aag egg ggt tta caa ate etc aga gaa ctg aag cga 
Thr Ser Arg Ala Lys Arg Gly Leu Gin He Leu Arg Glu Leu Lys Arg 

740 745 750 

aac ccc cac cga ate cag ttt atg ate gga att tea caa aat aaa ttc 
Asn Pro His Arg He Gin Phe Met He Gly He Ser Gin Asn Lys Phe 

755 760 765 

ctg gtc aat etc eta act gat aca aaa aaa eta gta cag ata aac cca 
Leu Val Asn Leu Leu Thr Asp Thr Lys Lys Leu Val Gin He Asn Pro 
770 775 780 

gat gac tat aca gtt tea aac cgc cat aac aat ggg tct ttt gtc ctg 
Asp Asp Tyr Thr Val Ser Asn Arg His Asn Asn Gly Ser Phe Val Leu 
785 ~ 790 795 

gac aca age cga gat ggc aag cct gtt tct tac tat tta agt gat aac 
Asp Thr Ser Arg Asp Gly Lys Pro Val Ser Tyr Tyr Leu Ser Asp Asn 
800 805 810 815 

gat tct cac ttg taa 
Asp Ser His Leu 



2064 



get gca ggc gtc aag tec atg aac ctg aaa gag ggg gac cat gtc caa 2112 
Ala Ala Gly Val Lys Ser Met Asn Leu Lys Glu Gly Asp His Val Gin 
690 " 695 700 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2463 



<210> 60 
<211> 819 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 60 

Met Ala Gly Asp Gin Glu Thr Ser Lys He Gin Glu Leu Thr Leu 
15 10 15 



Asp Val Met Gly Asp Arg Phe Gly Arg Tyr Ser Lys Tyr He He 

20 25 30 
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Glu Arg Ala Leu Pro Asp Leu Arg Asp Gly Leu Lys Pro Val Gin Arg 
35 40 45 



Arg He Leu Tyr Ala Met His Gin Asp Lys Asn Thr Tyr Asp Lys Ala 
50 55 60 



Tyr Arg Lys Ser Ala Lys Thr Val Gly Asn Val He Gly Asn Tyr His 
65 * 70 75 80 



Pro His Gly Asp Thr Ser Val Tyr Asp Ala Met Val Arg Leu Ser Gin 

85 90 95 



Pro Trp Lys Met Arg His Pro Leu Val Asp Met His Gly Asn Lys Gly 

100 105 HO 



Ser Met Asp Gly Asp Pro Pro Ala Ala Met Arg Tyr Thr Glu Ala Arg 
115 120 125 



Leu Ser Lys He Ala Ser Asp Leu Leu Ala Asp He Asp Lys Glu Thr 
13 0 135 140 



Val Asp His Val Leu Asn Phe Asp Asp Thr Thr Glu Glu Pro Thr Val 
145 150 155 160 



Leu Pro Ala Arg Phe Pro Asn Leu Leu Val Asn Gly Ala Ser Gly He 

165 170 175 



Ser Ala Gly Tyr Ala Thr Asp He Pro Pro His Asn Leu Ser Glu Val 

180 185 190 



He Asp Ala Thr He His Leu He Asn His Pro Asn Ala Arg Leu Glu 
195 200 205 



Thr Leu Met Asp Tyr He Gin Gly Pro Asp Phe Pro Thr Gly Gly He 
210 215 220 



He Gin Gly Lys Ser Gly Leu Lys Lys Ala Tyr Gin Thr Gly Lys Gly 
225 230 235 240 



Lys He He He Arg Ala Lys Ala Asp He Glu Ala He Arg Gly Gly 

245 250 255 
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Lys Ser Gin lie Val lie Ser Gin He Pro Tyr Glu Val Asn Lys Ala 

260 265 270 

Arg Leu Val Gin Lys He Asp Asp He Arg He Asn Lys Lys He Asp 
275 280 285 

Gly He Ala Asp Val Arg Asp Glu Ser Asp Arg Ser Gly Leu Arg He 
290 295 300 

Val Val Glu Thr Lys Lys Asp Gly Asp Gly Glu Gly He Leu Thr Tyr 
305 310 315 320 

Leu Leu Lys Asn Thr Asp Leu Gin Val Thr Tyr Asn Leu Asn Met Val 

325 330 335 

Ala He Asp Lys Lys Arg Pro Gin Gin Val Ser Leu Lys Gin He Leu 

340 345 350 

Ser Ser Tyr Leu Asp His Lys Arg Thr Val Val Gin Asn Arg Thr Arg 
355 360 365 

Tyr Leu Leu Ala Lys Ala Lys Asp Arg Gin His He Val Gin Gly Leu 
370 375 380 

He Lys Ala He Ser He Leu Asp Asp Leu He Gin Thr He Arg Ala 
385 "* 390 395 400 

Ser Glu Asn Lys Ala Asn Ala Lys Glu Asn He He Gin Ala Tyr Gly 

405 410 415 

Phe Ser Gin Asp Gin Ala Glu Ala He Val Ser Leu Gin Leu Tyr Arg 

420 425 430 

Leu Thr Asn Thr Asp He Lys Asp Leu Gin Ala Glu Ala Lys Asp Leu 
435 440 445 

Ala Gin Ala He Leu Thr Tyr Gin Asp Leu Leu Thr Asn Lys Ala Ser 
450 455 460 

Leu Asp Ala Leu Met Lys Glu Glu Leu Lys Glu Val Lys Gin Ala Tyr 
465 470 475 480 
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Gly Glu Asp Arg Leu Thr Gin Val Gin Asp Lys He Glu Lys Leu Glu 

485 490 495 

He Glu Thr Gin Val Leu Val Ser Glu Glu Asp Val Met Val Thr Val 

500 505 510 

Thr Gin Gly Gly Tyr Leu Lys Arg Thr Ser He Arg Ser Tyr Lys Ala 
515 520 525 

Ser Gin Val Glu Glu Leu Gly Arg Arg Glu Asp Asp Leu Val He Phe 
530 535 540 

Met Gin Glu Leu Ser Thr Leu Asp Gin Leu Leu He Phe Thr Ser Lys 
545 550 555 560 

Gly Asn Val Val Asn Arg Pro Val His Glu Leu Pro Asp He Lys Trp 

565 570 575 

Lys Asp He Gly Glu His Leu Ser Arg Thr He Pro Leu Gly Glu Asp 

580 585 590 

Glu Glu Leu He Lys Val Tyr Pro Tyr Arg Glu Leu Asp Ala Gly Lys 
595 600 605 

Arg Tyr Val Phe He Thr Arg. Asp Gly Tyr He Lys Gin Ser Pro Glu 
610 615 620 

Thr Glu Phe Glu Pro Lys Arg Thr Tyr Lys Ser Arg Ala Ser Thr Ala 
625 630 635 640 

He Lys Leu Lys Ser Asp Gin Asp Arg Leu Gin Ala Val Tyr Tyr He 

645 650 655 

Pro Asp Gin Glu Asp Tyr Asp Val Phe Leu Ala Ser Tyr Lys Gly Tyr 

660 665 670 

Gly Leu Lys Tyr Gly Leu Glu Glu Val Ser Glu Val Gly Ala Gin Ala 
6 75 680 685 

Ala Gly Val Lys Ser Met Asn Leu Lys Glu Gly Asp His Val Gin Asp 
690 695 700 

Gly Leu Val Phe Lys Arg Lys Gin Phe Gin Glu Ala Leu Phe He Thr 
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705 710 715 720 



Gin Arg Ala Ser Val Lys Lys Met Ala Leu His Asp Phe Asp Arg Thr 

725 730 735 



Ser Arg Ala Lys Arg Gly Leu Gin He Leu Arg Glu Leu Lys Arg Asn 

740 745 750 



Pro His Arg He Gin Phe Met He Gly He Ser Gin Asn Lys Phe Leu 
755 760 765 



Val Asn Leu Leu Thr Asp Thr Lys Lys Leu Val Gin He Asn Pro Asp 
770 775 780 



Asp Tyr Thr Val Ser Asn Arg His Asn Asn Gly Ser Phe Val Leu Asp 
785 790 795 800 



Thr Ser Arg Asp Gly Lys Pro Val Ser Tyr Tyr Leu Ser Asp Asn Asp 

805 810 815 



Ser His Leu 



<210> 61 
<211> 1113 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) -.(1113) 

<223> 

<400> 61 

tta gtg gtt gag aca aaa tea aaa eta gaa aat gca gta aac ace etc 

Met Val Glu Thr Lys Ser Lys Leu Glu Asn Ala Val Asn Thr Leu 
15 10 15 



48 



att aaa gac ttg aaa aat aaa aaa gag teg acc att tct tat att gac 96 
He Lys Asp Leu Lys Asn Lys Lys Glu Ser Thr He Ser Tyr He Asp 

20 25 30 

etc age aac aaa att get gaa ecc ttc gaa ctt gaa agt gaa gec atg 144 
Leu Ser Asn Lys He Ala Glu Pro Phe Glu Leu Glu Ser Glu Ala Met 

35 40 45 

gac aag tta ate cag caa tta gaa gat gat ggg att ggt gta gtt gac 192 
Asp Lys Leu He Gin Gin Leu Glu Asp Asp Gly He Gly Val Val Asp 
50 55 60 
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caa gac ggt aat ccc ttg gcc aag caa eta gec aag cag gaa gaa gaa 
Gin Asp Gly Asn Pro Leu Ala Lys Gin Leu Ala Lys Gin Glu Glu Glu 
65 70 75 

gca gaa aaa gcc aag gat gaa gaa atg ata gcc cca cct ggg gtt aaa 
Ala Glu Lys Ala Lys Asp Glu Glu Met He Ala Pro Pro Gly Val Lys 
80 85 90 95 

att aac gac cct gtc egg atg tac eta aaa gaa att ggc egg gta gat 
He Asn Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Arg Val Asp 

100 105 110 

ctt tta gat get gaa gaa gaa gtg gcc eta gcc aag egg att gaa gaa 
Leu Leu Asp Ala Glu Glu Glu Val Ala Leu Ala Lys Arg He Glu Glu 

115 120 125 

ggc gat gaa ate get aaa caa gaa eta get gag get aac ttg aga ctg 
Gly Asp Glu He Ala Lys Gin Glu Leu Ala Glu Ala Asn Leu Arg Leu 
130 135 140 

gtt gtc tct att get aaa egg tac gtt ggc egg ggc atg age ttt ttg 
Val Val Ser He Ala Lys Arg Tyr Val Gly Arg Gly Met Ser Phe Leu 
145 150 155 

gac ttg ate cag gaa ggg aat atg ggg eta atg aag gca gtt gaa aaa 
Asp Leu He Gin Glu Gly Asn Met Gly Leu Met Lys Ala Val Glu Lys 
160 165 170 175 

ttt gac tac gaa aaa ggt ttc aaa ttt tea ace tat gcc ace tgg tgg 
Phe Asp Tyr Glu Lys Gly Phe Lys Phe Ser Thr Tyr Ala Thr Trp Trp 

180 185 190 

ate cgt caa gcc ate act egg gcc att gcc gac caa gcc cga acc ate 
He Arg Gin Ala He Thr Arg Ala He Ala Asp Gin Ala Arg Thr lie 

195 200 205 

egg att ccg gtc cac atg gtc gaa act att aac aag ctg gtc cga ate 
Arg He Pro Val His Met Val Glu Thr He Asn Lys Leu Val Arg He 
210 215 220 



240 



att ggg gca gag atg gat ttg cca acc gaa aaa gtc aga gat att ttg 
He Gly Ala Glu Met Asp Leu Pro Thr Glu Lys Val Arg Asp He Leu 
240 " 245 250 255 



288 



336 



384 



432 



480 



528 



576 



624 



672 



cag egg cag etc eta caa gaa eta ggc egg gaa cca acc cca gaa gaa 720 
Gin Arg Gin Leu Leu Gin Glu Leu Gly Arg Glu Pro Thr Pro Glu Glu 
225 230 235 



768 



aaa att tec caa gaa ccc gtc tec ctt gaa acc cca att ggg gaa gaa 816 

Lys He Ser Gin Glu Pro Val Ser Leu Glu Thr Pro He Gly Glu Glu 

260 265 270 

gaa gat tec cac ctg gga gac ttt att gaa gat gat ggg gcc ttg teg 864 

Glu Asp Ser His Leu Gly Asp Phe He Glu Asp Asp Gly Ala Leu Ser 

275 280 285 



cca tct gat aat gca get tat gag ctg ttg aaa ggg gaa etc aaa gga 



912 
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Pro Ser Asp Asn Ala Ala Tyr Glu Leu Leu Lys Gly Glu Leu Lys Gly 

290 295 300 

gtc tta gac acc eta act gac egg gaa gaa aat gtc ttg cgc etc cgt 

Val Leu Asp Thr Leu Thr Asp Arg Glu Glu Asn Val Leu Arg Leu Arg 

305 310 315 

ttt ggc eta gat gat ggc cgt caa cgt act tta gaa gat gtc ggt aag 

Phe Gly Leu Asp Asp Gly Arg Gin Arg Thr Leu Glu Asp Val Gly Lys 

320 325 330 335 

gtc ttt ggg gtc acc egg gag egg ate cgt caa att gaa gcg aag gee 

Val Phe Gly Val Thr Arg Glu Arg lie Arg Gin lie Glu Ala Lys Ala 

340 345 350 

etc cgc aaa etc cgc cac cct age egg tec aaa caa tta aaa gac ttt 
Leu Arg Lys Leu Arg His Pro Ser Arg Ser Lys Gin Leu Lys Asp Phe 

355 360 365 



960 



1008 



1056 



1104 



tta gaa tag 1113 
Leu Glu 



<210> 62 
<211> 369 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 62 

Met Val Glu Thr Lys Ser Lys Leu Glu Asn Ala Val Asn Thr Leu lie 
15 10 15 



Lys Asp Leu Lys Asn Lys Lys Glu Ser Thr He Ser Tyr He Asp Leu 

20 25 30 



Ser Asn Lys He Ala Glu Pro Phe Glu Leu Glu Ser Glu Ala Met Asp 
35 40 45 



Lys Leu He Gin Gin Leu Glu Asp Asp Gly He Gly Val Val Asp Gin 
50 55 60 



Asp Gly Asn Pro Leu Ala Lys Gin Leu Ala Lys Gin Glu Glu Glu Ala 
65 70 75 80 



Glu Lys Ala Lys Asp Glu Glu Met He Ala Pro Pro Gly Val Lys He 

85 90 95 



Asn Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Arg Val Asp Leu 

100 105 HO 
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Leu Asp Ala Glu Glu Glu Val Ala Leu Ala Lys Arg He Glu Glu Gly 
115 120 125 

Asp Glu He Ala Lys Gin Glu Leu Ala Glu Ala Asn Leu Arg Leu Val 
130 135 140 

Val Ser He Ala Lys Arg Tyr Val Gly Arg Gly Met Ser Phe Leu Asp 
145 150 155 160 

Leu He Gin Glu Gly Asn Met Gly Leu Met Lys Ala Val Glu Lys Phe 

165 170 175 

Asp Tyr Glu Lys Gly Phe Lys Phe Ser Thr Tyr Ala Thr Trp Trp He 

180 185 190 



Arg Gin Ala He Thr Arg Ala He Ala Asp Gin Ala Arg Thr He Arg 
195 200 205 

He Pro Val His Met Val Glu Thr He Asn Lys Leu Val Arg He Gin 
210 215 220 

Arg Gin Leu Leu Gin Glu Leu Gly Arg Glu Pro Thr Pro Glu Glu He 
225 230 235 240 

Gly Ala Glu Met Asp Leu Pro Thr Glu Lys Val Arg Asp He Leu Lys 

245 250 255 



He Ser Gin Glu Pro Val Ser Leu Glu Thr Pro He Gly Glu Glu Glu 

260 265 270 



Asp Ser His Leu Gly Asp Phe He Glu Asp Asp Gly Ala Leu Ser Pro 
275 280 285 

Ser Asp Asn Ala Ala Tyr Glu Leu Leu Lys Gly Glu Leu Lys Gly Val 
290 295 300 

Leu Asp Thr Leu Thr Asp Arg Glu Glu Asn Val Leu Arg Leu Arg Phe 
305 310 315 320 

Gly Leu Asp Asp Gly Arg Gin Arg Thr Leu Glu Asp Val Gly Lys Val 

325 330 335 



Phe Gly Val Thr Arg Glu Arg He Arg Gin He Glu Ala Lys Ala Leu 
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340 345 350 



Arg Lys Leu Arg His Pro Ser Arg Ser Lys Gin Leu Lys Asp Phe Leu 
355 360 365 



Glu 



<210> 63 
<211> 1854 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (1) . . (1854) 

<223> 

<400> 63 

atg gtt aga ata cct gaa gag acc att aat caa ata cga age cag gca 
Met Val Arg He Pro Glu Glu Thr He Asn Gin He Arg Ser Gin Ala 
! - 5 10 15 

gat att gtc gat gtc att ggc caa tac ttg gac tta aac aag tct ggg 
Asp He Val Asp Val He Gly Gin Tyr Leu Asp Leu Asn Lys Ser Gly 

20 25 30 

gec aat tac ttt gec cac tgc ccc ttc cat gaa gac age acg cct tct 
Ala Asn Tyr Phe Ala His Cys Pro Phe His Glu Asp Ser Thr Pro Ser 
35 40 45 

ttt teg gtc aac aga gac aag caa att tat aag tgc ttt tct tgc aaa 
Phe Ser Val Asn Arg Asp Lys Gin He Tyr Lys Cys Phe Ser Cys Lys 
50 55 60 

cga ggt ggc agt gtc ttt age ttt ata caa gag aag gag gga ctt tec 
Arg Gly Gly Ser Val Phe Ser Phe He Gin Glu Lys Glu Gly Leu Ser 
65 ~ 70 75 80 

ttc cca gaa teg gtt ctt aaa gtg gca gac tta get aat gtg gac ctt 
Phe Pro Glu Ser Val Leu Lys Val Ala Asp Leu Ala Asn Val Asp Leu 

85 90 95 



tct ccc tac cga gac etc tat acc ate cat gac cag gee aag gac tac 
Ser Pro Tyr Arg Asp Leu Tyr Thr He His Asp Gin Ala Lys Asp Tyr 
115 120 125 

tac cag tat ate etc tta aag gec cag gtg gga gaa gtt get tac gac 
Tyr Gin Tyr He Leu Leu Lys Ala Gin Val Gly Glu Val Ala Tyr Asp 
130 135 140 



48 



96 



144 



192 



240 



288 



gat ccg gee tta aaa gaa get gtc caa ggc caa cct gac aaa gee gat 33 6 

Asp Pro Ala Leu Lys Glu Ala Val Gin Gly Gin Pro Asp Lys Ala Asp 

100 105 HO 



384 



432 
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tat etc cag aat cgt ggg att tec aga gag gtg atg gaa gag ttc gaa 480 
Tyr Leu Gin Asn Arg Gly lie Ser Arg Glu Val Met Glu Glu Phe Glu 
145 150 155 160 

ctg ggt tat tct ccc age caa agg gag teg etc cac ctt tat ttg cag 528 
Leu Gly Tyr Ser Pro Ser Gin Arg Glu Ser Leu His Leu Tyr Leu Gin 

165 170 175 

tec caa gac cag gcg gac ttg aca gat gac tta ctg gaa gaa acc ggc 57 6 

Ser Gin Asp Gin Ala Asp Leu Thr Asp Asp Leu Leu Glu Glu Thr Gly 

180 185 190 

ctt ttt tec aaa aga gaa gtg gaa agt gat agt ttt aaa gac cgc ttt 624 
Leu Phe Ser Lys Arg Glu Val Glu Ser Asp Ser Phe Lys Asp Arg Phe 
195 200 205 

gee aag egg ate ate ttc ccc tta aag aac tta caa ggg cag acg gtg 672 
Ala Lys Arg lie lie Phe Pro Leu Lys Asn Leu Gin Gly Gin Thr Val 
210 215 220 

ggc ttt teg ggc egg tat ttc caa gat gag cct aac cag gac ttc cat 720 
Gly Phe Ser Gly Arg Tyr Phe Gin Asp Glu Pro Asn Gin Asp Phe His 
225 230 235 240 

cat gee aag tat tta aac agt cca gaa acc aaa ata ttc aat aaa egg 7 68 

His Ala Lys Tyr Leu Asn Ser Pro Glu Thr Lys lie Phe Asn Lys Arg 

245 250 255 

egg acc etc ttt aac tac cac cag gee aag gee tac att cgt egg gee 816 
Arg Thr Leu Phe Asn Tyr His Gin Ala Lys Ala Tyr He Arg Arg Ala 

260 265 270 

aag gaa gtt gtc tta ttc gaa ggt tac atg gat gtg att get get tgg 864 
Lys Glu Val Val Leu Phe Glu Gly Tyr Met Asp Val He Ala Ala Trp 
275 280 285 

caa gcg ggg gtc aaa aat ggc tta get tec atg ggg acc agt ata aca 912 
Gin Ala Gly Val Lys Asn Gly Leu Ala Ser Met Gly Thr Ser He Thr 
290 295 300 

get gac caa gtc cag acc atg caa agg att get gac acc tta gtc ttg 960 
Ala Asp Gin Val Gin Thr Met Gin Arg lie Ala Asp Thr Leu Val Leu 
305 310 315 320 

gee ttt gac ggg gat gaa get ggc ctt gaa tec age aaa aag ate ctg 1008 
Ala Phe Asp Gly Asp Glu Ala Gly Leu Glu Ser Ser Lys Lys He Leu 

325 330 335 

gat gac tta age ttg acc age aag ctt caa att gaa gtg gtc att ttc 1056 
Asp Asp Leu Ser Leu Thr Ser Lys Leu Gin He Glu Val Val He Phe 

340 345 350 

cct aaa aaa atg gac ccg gat gaa tat att aga gaa aat gga cca gaa 1104 
Pro Lys Lys Met Asp Pro Asp Glu Tyr He Arg Glu Asn Gly Pro Glu 
355 360 365 

gec ttt caa aat etc ate caa cat ggt agg atg act gtc tac caa ttc 1152 
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Ala Phe Gin Asn Leu lie Gin His Gly Arg Met Thr Val Tyr Gin Phe 
370 375 380 

tta aaa gaa tac ttt aaa aaa tec tac aat eta gat aac gac teg gac 

Leu Lys Glu Tyr Phe Lys Lys Ser Tyr Asn Leu Asp Asn Asp Ser Asp 

385 390 395 400 

egg ttg aaa ttt ate caa ace atg ace aat aaa att ggc aag eta get 

Arg Leu Lys Phe He Gin Thr Met Thr Asn Lys He Gly Lys "Leu Ala 

405 410 415 



aac ctg tct tat gat acg att ata age caa gtt caa agt gaa gee act 
Asn Leu Ser Tyr Asp Thr He He Ser Gin Val Gin Ser Glu Ala Thr 
435 440 445 

eta aac cag caa gag get ttg aaa aag gac egg cat aag gaa ttt tct 
Leu Asn Gin Gin Glu Ala Leu Lys Lys Asp Arg His Lys Glu Phe Ser 
450 455 460 

caa gca aga gtg gaa gtc aaa gee cca agt agt caa aag act aag att 
Gin Ala Arg Val Glu Val Lys Ala Pro Ser Ser Gin Lys Thr Lys He 
465 470 475 480 

gac egg gee cag gaa aaa ctt 'tta aac cga etc ttt tac tat ccc caa 
Asp Arg Ala Gin Glu Lys Leu Leu Asn Arg Leu Phe Tyr Tyr Pro Gin 

485 490 495 

gtt caa gag ate ate gat get tat aat ccg gac ttt gaa ttt aaa acg 
Val Gin Glu He lie Asp Ala Tyr Asn Pro Asp Phe Glu Phe Lys Thr 

500 505 510 

gaa gtc cac cag egg att tac etc ttg ttt tta gaa tac age cag gaa 
Glu Val His Gin Arg He Tyr Leu Leu Phe Leu Glu Tyr Ser Gin Glu 
515 520 525 

aat gat age att gat tct ttc ate gat ttt gtc aaa gac aag gag acg 
Asn Asp Ser He Asp Ser Phe He Asp Phe Val Lys Asp Lys Glu Thr 
530 535 540 

aaa gag gtc ata tct gat ata atg tgg aca tec att gag gtc gaa ccc 
Lys Glu Val He Ser Asp He Met Trp Thr Ser He Glu Val Glu Pro 
545 550 555 560 

tea gat gaa gaa ate eta gac tac ttg gac tac att gac caa ace tac 
Ser Asp Glu Glu He Leu Asp Tyr Leu Asp Tyr He Asp Gin Thr Tyr 

565 570 575 



aaa cag tec ggt aat aag aag cga gag ctg gaa tta ace aat caa tta 
Lys Gin Ser Gly Asn Lys Lys Arg Glu Leu Glu Leu Thr Asn Gin Leu 



1200 



1248 



tec ccc ttg gaa agg gaa gtc tat gee aag gat ttg gca gaa gaa ttt 1296 
Ser Pro Leu Glu Arg Glu Val Tyr Ala Lys Asp Leu Ala Glu Glu Phe 

420 425 430 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



ccc ctg gag caa aaa cgc caa gac tgc ttg gag gaa gtc aaa gca get 177 6 

Pro Leu Glu Gin Lys Arg Gin Asp Cys Leu Glu Glu Val Lys Ala Ala 

580 585 590 



1824 
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595 600 605 

att gaa ata aac cgt atg eta aaa caa taa 1854 
lie Glu lie Asn Arg Met Leu Lys Gin 
610 615 



<210> 64 
<211> 617 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 64 

Met Val Arg lie Pro Glu Glu Thr lie Asn Gin He Arg Ser Gin Ala 
15 10 15 



Asp He Val Asp Val He Gly Gin Tyr Leu Asp Leu Asn Lys Ser Gly 

20 25 30 



Ala Asn Tyr Phe Ala His Cys Pro Phe His Glu Asp Ser Thr Pro Ser 
35 40 45 



Phe Ser Val Asn Arg Asp Lys Gin He Tyr Lys Cys Phe Ser Cys Lys 
50 55 60 



Arg Gly Gly Ser Val Phe Ser Phe He Gin Glu Lys Glu Gly Leu Ser 
65 70 75 80 



Phe Pro Glu Ser Val Leu Lys Val Ala Asp Leu Ala Asn Val Asp Leu 

85 90 95 



Asp Pro Ala Leu Lys Glu Ala Val Gin Gly Gin Pro Asp Lys Ala Asp 

100 105 110 



Ser Pro Tyr Arg Asp Leu Tyr Thr He His Asp Gin Ala Lys Asp Tyr 
115 120 125 



Tyr Gin Tyr He Leu Leu Lys Ala Gin Val Gly Glu Val Ala Tyr Asp 
130 135 140 



Tyr Leu Gin Asn Arg Gly He Ser Arg Glu Val Met Glu Glu Phe Glu 
145 150 155 160 



Leu Gly Tyr Ser Pro Ser Gin Arg Glu Ser Leu His Leu Tyr Leu Gin 

165 170 175 
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Ser Gin Asp Gin Ala Asp Leu Thr Asp Asp Leu Leu Glu Glu Thr Gly 

180 185 190 



Leu Phe Ser Lys Arg Glu Val Glu Ser Asp Ser Phe Lys Asp Arg Phe 
195 200 205 



Ala Lys Arg lie lie Phe Pro Leu Lys Asn Leu Gin Gly Gin Thr Val 
210 215 220 



Gly Phe Ser Gly Arg Tyr Phe Gin Asp Glu Pro Asn Gin Asp Phe His 
225 230 235 240 



His Ala Lys Tyr Leu Asn Ser Pro Glu Thr Lys lie Phe Asn Lys Arg 

245 250 255 



Arg Thr Leu Phe Asn Tyr His Gin Ala Lys Ala Tyr lie Arg Arg Ala 

260 265 270 



Lys Glu Val Val Leu Phe Glu Gly Tyr Met Asp Val lie Ala Ala Trp 
275 280 285 



Gin Ala Gly Val Lys Asn Gly Leu Ala Ser Met Gly Thr Ser He Thr 
290 295' 300 



Ala Asp Gin Val Gin Thr Met Gin Arg He Ala Asp Thr Leu Val Leu 
305 310 315 320 



Ala Phe Asp Gly Asp Glu Ala Gly Leu Glu Ser Ser Lys Lys He Leu 

325 330 335 



Asp Asp Leu Ser Leu Thr Ser Lys Leu Gin He Glu Val Val He Phe 

340 345 350 



Pro Lys Lys Met Asp Pro Asp Glu Tyr He Arg Glu Asn Gly Pro Glu 
355 ~ 360 365 



Ala Phe Gin Asn Leu He Gin His Gly Arg Met Thr Val Tyr Gin Phe 
370 375 380 



Leu Lys Glu Tyr Phe Lys Lys Ser Tyr Asn Leu Asp Asn Asp Ser Asp 
385 390 395 400 



Arg Leu Lys Phe He Gin Thr Met Thr Asn Lys He Gly Lys Leu Ala 
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405 410 415 



Ser Pro Leu Glu Arg Glu Val Tyr Ala Lys Asp Leu Ala Glu Glu Phe 

420 425 430 



Asn Leu Ser Tyr Asp Thr lie lie Ser Gin Val Gin Ser Glu Ala Thr 
435 440 445 



Leu Asn Gin Gin Glu Ala Leu Lys Lys Asp Arg His Lys Glu Phe Ser 
450 455 460 



Gin Ala Arg Val Glu Val Lys Ala Pro Ser Ser Gin Lys Thr Lys He 
465 470 475 480 



Asp Arg Ala Gin Glu Lys Leu Leu Asn Arg Leu Phe Tyr Tyr Pro Gin 

485 490 495 



Val Gin Glu He He Asp Ala Tyr Asn Pro Asp Phe Glu Phe Lys Thr 

500 505 510 



Glu Val His Gin Arg He Tyr Leu Leu Phe Leu Glu Tyr Ser Gin Glu 
515 520 525 



Asn Asp Ser He Asp Ser Phe He Asp Phe Val Lys Asp Lys Glu Thr 
530 535 540 



Lys Glu Val He Ser Asp He Met Trp Thr Ser He Glu Val Glu Pro 
545 550 555 560 



Ser Asp Glu Glu He Leu Asp Tyr Leu Asp Tyr He Asp Gin Thr Tyr 

565 570 575 



Pro Leu Glu Gin Lys Arg Gin Asp Cys Leu Glu Glu Val Lys Ala Ala 

580 585 590 



Lys Gin Ser Gly Asn Lys Lys Arg Glu Leu Glu Leu Thr Asn Gin Leu 
595 600 605 



He Glu He Asn Arg Met Leu Lys Gin 
610 615 



<210> 65 
<211> 987 



WO 03/104391 



138/235 



PCT/US02/36122 



<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (55) . . (987) 

<223> 

<400> 65 

gccagacaat cggacaacta ttaccggaca actttaagcc aggggacatg gaat atg 57 

Met 
1 



gaa aac aat gaa aac aat gaa aac aaa gat age aaa aca ttt aaa tea 
Glu Asn Asn Glu Asn Asn Glu Asn Lys Asp Ser Lys Thr Phe Lys Ser 

5 10 15 



150 155 160 



act tea ggc ttg gaa ggg gaa aat ate cag gag etc att caa ace ate 
Thr Ser Gly Leu Glu Gly Glu Asn He Gin Glu Leu He Gin Thr He 

165 170 175 



105 



ggt ttt gtc acc ctt ctt ggc egg ccc aat gtg ggc aag tea acc ctg 153 
Gly Phe Val Thr Leu Leu Gly Arg Pro Asn Val Gly Lys Ser Thr Leu 
20 25 30 

etc aac caa ata tta ggc cag aag att acc att ate agt gac aaa ccc 201 
Leu Asn Gin He Leu Gly Gin Lys He Thr He He Ser Asp Lys Pro 
3.5 40 45 

caa aca acc egg aat aaa ate cag ggt att tac acc gac caa gcg ggg 249 
Gin Thr Thr Arg Asn Lys He Gin Gly He Tyr Thr Asp Gin Ala Gly 
50 55 60 65 

caa att gtc ttt ate gac aca cct ggt ata cat aaa ccc aag cac cgc 297 
Gin He Val Phe He Asp Thr Pro Gly He His Lys Pro Lys His Arg 

70 75 "* 80 

ctg ggc egg ttt atg gtg gat teg get atg teg acc ate aat gag gtg 345 
Leu Gly Arg Phe Met Val Asp Ser Ala Met Ser Thr He Asn Glu Val 

85 90 95 

gac ctg gtc tta ttt gtg gtc aat gtc agg gaa aag att ggc ccg ggg 393 
Asp Leu Val Leu Phe Val Val Asn Val Arg Glu Lys He Gly Pro Gly 
100 105 110 

gac egg ttc att ate gac aag ttg cga acc ate gat acg cca gtt ttt 441 
Asp Arg Phe lie He Asp Lys Leu Arg Thr He Asp Thr Pro Val Phe 
"115 120 125 

tta att att aac cag att gac cag gtc gat cca aca gac etc eta ccg 489 
Leu He He Asn Gin He Asp Gin Val Asp Pro Thr Asp Leu Leu Pro 
130 135 140 145 

gtt att age gac tac caa gag gaa ttc gac ttt gee gaa gtg gtt cca 537 
Val He Ser Asp Tyr Gin Glu Glu Phe Asp Phe Ala Glu Val Val Pro 



585 
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aag tct tac eta cct g-tt gga ccc caa ttt tac ccg gac gac cag gtc 
Lys Ser Tyr Leu Pro Val Gly Pro Gin Phe Tyr Pro Asp Asp Gin Val 
180 185 190 

teg gac cac ccc gaa tac ttt att att tea gaa etc ate egg gag aag 
Ser Asp His Pro Glu Tyr Phe lie lie Ser Glu Leu lie Arg Glu Lys 
195 200 205 

gtt tta gac ttg get aga gaa gag att cct cat tea gta gca gta gta 
Val Leu Asp Leu Ala Arg Glu Glu He Pro His Ser Val Ala Val Val 
210 215 220 225 

act gag aag gta gac cga aac caa gat ggt aaa gtc caa ace tat gec 
Thr Glu Lys Val Asp Arg Asn Gin Asp Gly Lys Val Gin Thr Tyr Ala 

230 235 240 

acc att att gtc gaa cgc aag age caa aag ggg att att ate ggc aag 
Thr He He Val Glu Arg Lys Ser Gin Lys Gly He He He Gly Lys 

245 250 255 

caa ggg tec atg att aaa aaa att ggt age eta get egg cga gat att 
Gin Gly Ser Met He Lys Lys He Gly Ser Leu Ala Arg Arg Asp He 
260 265 270 

gag aaa eta ctg gga gat aag att tac ttg gaa etc tgg gtt aaa gtc 
Glu Lys Leu Leu Gly Asp Lys He Tyr Leu Glu Leu Trp Val Lys Val 
275 280 285 

caa aga gac tgg egg gac aag ccc agt cgc tta gaa gac ttt ggc tac 
Gin Arg Asp Trp Arg Asp Lys Pro Ser Arg Leu Glu Asp Phe Gly Tyr 
290 295 300 305 

aat gaa gac aac tat tag 
Asn Glu Asp Asn Tyr 

310 



<210> 66 
<211> 310 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 66 

Met Glu Asn Asn Glu Asn Asn Glu Asn Lys Asp Ser Lys Thr Phe Lys 
1 5 10 15 

Ser Gly Phe Val Thr Leu Leu Gly Arg Pro Asn Val Gly Lys Ser Thr 

20 25 30 



Leu Leu Asn Gin He Leu Gly Gin Lys He Thr He He Ser Asp Lys 
35 40 45 



Pro Gin Thr Thr Arg Asn Lys He Gin Gly He Tyr Thr Asp Gin Ala 
50 55 60 
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Gly Gin lie Val Phe lie Asp Thr Pro Gly He His Lys Pro Lys His 
65 70 75 80 



Arg Leu Gly Arg Phe Met Val Asp Ser Ala Met Ser Thr He Asn Glu 

85 90 95 



Val Asp Leu Val Leu Phe Val Val Asn Val Arg Glu Lys lie Gly Pro 

100 105 110 



Gly Asp Arg Phe He He Asp Lys Leu Arg Thr He Asp Thr Pro Val 
115 120 125 



Phe Leu He He Asn Gin He Asp Gin Val Asp Pro Thr Asp Leu Leu 
130 135 140 



Pro Val He Ser Asp Tyr Gin Glu Glu Phe Asp Phe Ala Glu Val Val 
145 150 155 160 



Pro Thr Ser Gly Leu Glu Gly Glu Asn He Gin Glu Leu He Gin Thr 

165 170 175 



He Lys Ser Tyr Leu Pro Val Gly Pro Gin Phe Tyr Pro Asp Asp Gin 

180 185 190 



Val Ser Asp His Pro Glu Tyr Phe He He Ser Glu Leu He Arg Glu 
195 200 205 



Lys Val Leu Asp Leu Ala Arg Glu Glu He Pro His Ser Val Ala Val 
210 215 220 



Val Thr Glu Lys Val Asp Arg Asn Gin Asp Gly Lys Val Gin Thr Tyr 
225 230 235 240 



Ala Thr He He Val Glu Arg Lys Ser Gin Lys Gly He He He Gly 

245 250 255 



Lys Gin Gly Ser Met He Lys Lys He Gly Ser Leu Ala Arg Arg Asp 

260 265 270 



He Glu Lys Leu Leu Gly Asp Lys He Tyr Leu Glu Leu Trp Val Lys 
275 280 285 
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Val Gin Arg Asp Trp Arg Asp Lys Pro Ser Arg Leu Glu Asp Phe Gly 
290 295 300 



Tyr Asn Glu Asp Asn Tyr 
305 310 



<210> 67 
<211> 1557 
<212> DMA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (46) . . (1557) 

<223> 

<400> 67 

catgtctatg ttactttaat cagtggaaaa caagaggaga tcatt gtg att tec tct 57 

Met He Ser Ser 
1 

ttc tat tta gta gga gtc ttg aga ttg agt agt gaa aat aaa tta acc 105 
Phe Tyr Leu Val Gly Val Leu Arg Leu Ser Ser Glu Asn Lys Leu Thr 
5 10 15 20 

ttc aaa cac ttc ctt gca aac cag ttg acc aaa cga gac aat tta caa 153 
Phe Lys His Phe Leu Ala Asn Gin Leu Thr Lys Arg Asp Asn Leu Gin 

25 30 " 35 

ate ccc cgt tgg caa att ttt gcc gtt tta ttt aca gga gcc gtg att 201 
He Pro Arg Trp Gin He Phe Ala Val Leu Phe Thr Gly Ala Val He 

40 45 50 

gtg gtt etc aac caa acg gcc atg tct acc gcc ttg cct aat atg att 249 
Val Val Leu Asn Gin Thr Ala Met Ser Thr Ala Leu Pro Asn Met He 
55 60 65 

gaa agt ttg ggc att gac cct age eta ggc cag tgg att gtc teg ggt 297 
Glu Ser Leu Gly He Asp Pro Ser Leu Gly Gin Trp He Val Ser Gly 
70 75 80 

tat acc ttg gtc aaa ggg att atg gtc ccc ata acc gcc ttt gcc atg 345 
Tyr Thr Leu Val Lys Gly He Met Val Pro He Thr Ala Phe Ala Met 
85 90 95 100 

acc aag tac egg aca egg aac ttt ttt att tta atg ttg gcc etc ttc 393 
Thr Lys Tyr Arg Thr Arg Asn Phe Phe He Leu Met Leu Ala Leu Phe 

105 HO 115 

tgt acc ggt agt ttt ttg act ggt ctg ggc ttt aat ttt ccg gtt gtg 441 
Cys Thr Gly Ser Phe Leu Thr Gly Leu Gly Phe Asn Phe Pro Val Val 

120 125 ' 130 



gtc atg ggg aca gtc ate cag ggt ata gcg get ggg atg ate ate ccc 



489 
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Val Met Gly Thr Val lie Gin Gly lie Ala Ala Gly Met He He Pro 

135 140 145 

ttg atg cag acc gtc etc ttg acc ttg atg ccg gtt gaa age cga ggc 537 

Leu Met Gin Thr Val Leu Leu Thr Leu Met Pro Val Glu Ser Arg Gly 

150 155 160 

act get atg ggg gta atg agt ggg gtt att ggt att ggt cca gca ctg 585 

Thr Ala Met Gly Val Met Ser Gly Val He Gly He Gly Pro Ala Leu 

165 170 175 180 

ggt ccc ctt gtc ggt ggg gtc att gtt gat get ttc acc tgg gaa att 633 

Gly Pro Leu Val Gly Gly Val He Val Asp Ala Phe Thr Trp Glu He 

185 190 195 

tta ttc tac ate tgg gee tta ate acc ctt tta ttg gtt cct tta act 681 

Leu Phe Tyr He Trp Ala Leu He Thr Leu Leu Leu Val Pro Leu Thr 

200 205 210 

tgg ctg gtc tta ccc gat gta ttg cca aat gca gat tta acc att aat 729 

Trp Leu Val Leu Pro Asp Val Leu Pro Asn Ala Asp Leu Thr He Asn 

215 220 225 

tgg gec aat ate egg gac tec etc att ggt ttt ggc etc etc etc ttt 777 
Trp Ala Asn He Arg Asp Ser Leu He Gly Phe Gly Leu Leu Leu Phe 

230 235 240 

age ttg tea gtc ttt ggt tct tec ggt ttt tct teg gtc att gec tgg 825 

Ser Leu Ser Val Phe Gly Ser Ser Gly Phe Ser Ser Val He Ala Trp 

245 250 255 260 

gtc age ttg ctt ate ggt tta gtc ttt gtc gee aag ttt ate cac ttc 873 

Val Ser Leu Leu He Gly Leu Val Phe Val Ala Lys Phe He His Phe 

265 270 275 

aac etc aag gca gac caa cca ate tta aat ctt aga etc ttt aaa aaa . 921 

Asn Leu Lys Ala Asp Gin Pro He Leu Asn Leu Arg Leu Phe Lys Lys 

280 285 290 

acc tat tac cgt egg get gtc ttg gta gee acc ttg ggg att gtc att 969 

Thr Tyr Tyr Arg Arg Ala Val Leu Val Ala Thr Leu Gly He Val He 

295 300 . 305 

att tct tgt eta tec aac att ate cct att tat gtt caa act gtt agg 1017 

He Ser Cys Leu Ser Asn He He Pro He Tyr Val Gin Thr Val Arg 

310 315 320 

ggc ttg ggg get tec ata gca ggc tta ate tta atg cca get ggt ate 1065 

Gly Leu Gly Ala Ser He Ala Gly Leu He Leu Met Pro Ala Gly He 

325 330 335 340 

ate aaa acc ate tta get cct ate tea ggc aaa ctt tat gac aag gtt 1113 

He Lys Thr He Leu Ala Pro He Ser Gly Lys Leu Tyr Asp Lys Val 

345 350 355 



gga gtg get egg att ggc ctt ate ggt ggt ate tta ctt tta gtt ggg 
Gly Val Ala Arg He Gly Leu He Gly Gly He Leu Leu Leu Val Gly 



1161 
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360 365 370 

tec tta tta eta gtt acc etc aat gaa get age tec ctt tac tta ctg 1209 
Ser Leu Leu Leu Val Thr Leu Asn Glu Ala Ser Ser Leu Tyr Leu Leu 
375 380 385 

atg att tac tac ggc ate tta tea gec ggt ttt ggc ttg ttt aat ate 1257 
Met lie Tyr Tyr Gly lie Leu Ser Ala Gly Phe Gly Leu Phe Asn lie 
390 395 400 

cct att acc act get ggc atg aat att atg gec aag gaa gat atg gga 1305 
Pro lie Thr Thr Ala Gly Met Asn lie Met Ala Lys Glu Asp Met Gly 
405 410 415 420 

cat gcg act tea gec egg caa acg gtc egg caa ate tct tea agt ttt 1353 
His Ala Thr Ser Ala Arg Gin Thr Val Arg Gin lie Ser Ser Ser Phe 

425 430 435 

gec gtt tec etc tec ttt ate ate atg acc ctg gtg act att gee act 1401 
Ala Val Ser Leu Ser Phe lie lie Met Thr Leu Val- Thr lie Ala Thr 

440 445 450 

tec ggc caa teg gtg ggg gtt ttc caa gat ggc ggt ccg aca gac tta 1449 
Ser Gly Gin Ser Val Gly Val Phe Gin Asp Gly Gly Pro Thr Asp Leu 
455 460 465 

aat atg gca gga gtc cga ggc gec ttt ate ttg gtg get ata ttt tea 1497 
Asn Met Ala Gly Val Arg Gly Ala Phe lie Leu Val Ala He Phe Ser 
470 475 480 

ate eta gee atg ate ttg ate ttc ttt tta aaa gac cct aaa gaa aaa 1545 
He Leu Ala Met He Leu He Phe Phe Leu Lys Asp Pro Lys Glu Lys 
485 490 495 500 

cca gac caa tag 1557 
Pro Asp Gin 

<210> 68 
<211> 503 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 68 

Met He Ser Ser Phe Tyr Leu Val Gly Val Leu Arg Leu Ser Ser Glu 
15 10 15 

Asn Lys Leu Thr Phe Lys His Phe Leu Ala Asn Gin Leu Thr Lys Arg 

20 25 30 

Asp Asn Leu Gin He Pro Arg Trp Gin He Phe Ala Val Leu Phe Thr 
35 40 45 



Gly Ala Val He Val Val Leu Asn Gin Thr Ala Met Ser Thr Ala Leu 
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50 55 60 



Pro Asn Met lie Glu Ser Leu Gly lie Asp Pro Ser Leu Gly Gin Trp 
65 70 75 80 



lie Val Ser Gly Tyr Thr Leu Val Lys Gly He Met Val Pro He Thr 

85 90 95 



Ala Phe Ala Met Thr Lys Tyr Arg Thr Arg Asn Phe Phe lie Leu Met 

100 105 110 



Leu Ala Leu Phe Cys Thr Gly Ser Phe Leu Thr Gly Leu Gly Phe Asn 
115 120 125 



Phe Pro Val Val Val Met Gly Thr Val He Gin Gly He Ala Ala Gly 
130 135 140 



Met He He Pro Leu Met Gin Thr Val Leu Leu Thr Leu Met Pro Val 
145 150 155 160 



Glu Ser Arg Gly Thr Ala Met Gly Val Met Ser Gly Val He Gly He 

165 170 175 



Gly Pro Ala Leu Gly Pro Leu Val Gly Gly Val He Val Asp Ala Phe 

180 185 190 



Thr Trp Glu He Leu Phe Tyr He Trp Ala Leu He Thr Leu Leu Leu 
195 200 205 



Val Pro Leu Thr Trp. Leu Val Leu Pro Asp Val Leu Pro Asn Ala Asp 
210 215 220 



Leu Thr He Asn Trp Ala Asn He Arg Asp Ser Leu He Gly Phe Gly 
225 230 235 240 



Leu Leu Leu Phe Ser Leu Ser Val Phe Gly Ser Ser Gly Phe Ser Ser 

245 250 255 



Val He Ala Trp Val Ser Leu Leu He Gly Leu Val Phe Val Ala Lys 

260 265 270 



Phe He His Phe Asn Leu Lys Ala Asp Gin Pro He Leu Asn Leu Arg 
275 280 285 
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Leu Phe Lys Lys Thr Tyr Tyr Arg Arg Ala Val Leu Val Ala Thr Leu 
290 295 300 



Gly lie Val lie lie Ser Cys Leu Ser Asn lie lie Pro lie Tyr Val 
305 310 315 320 



Gin Thr Val Arg Gly Leu Gly Ala Ser lie Ala Gly Leu He Leu Met 

325 330 335 



Pro Ala Gly He He Lys Thr lie Leu Ala Pro He Ser Gly Lys Leu 

340 345 350 



Tyr Asp Lys Val Gly Val Ala Arg He Gly Leu He Gly Gly He Leu 
355 360 365 



Leu Leu Val Gly Ser Leu Leu Leu Val Thr Leu Asn Glu Ala Ser Ser 
370 375 380 



Leu Tyr Leu Leu Met He Tyr Tyr Gly He Leu Ser Ala Gly Phe Gly 
385 390 "* 395 400 



Leu Phe Asn He Pro He Thr Thr Ala Gly Met Asn He Met Ala Lys 

405 410 415 



Glu Asp Met Gly His Ala Thr Ser Ala Arg Gin Thr Val Arg Gin He 

420 425 430 



Ser Ser Ser Phe Ala Val Ser Leu Ser Phe He He Met Thr Leu Val 
435 440 445 



Thr He Ala Thr Ser Gly Gin Ser Val Gly Val Phe Gin Asp Gly Gly 
450 455 460 



Pro Thr Asp Leu Asn Met Ala Gly Val Arg Gly Ala Phe He Leu Val 
465 470 475 480 



Ala He Phe Ser He Leu Ala Met He Leu He Phe Phe Leu Lys Asp 

485 490 495 



Pro Lys Glu Lys Pro Asp Gin 

500 
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<210> 69 
<211> 4392 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (58) . . (4392) 

<223> 

<400> 69 

aagggttcag gactttcgta tctggccctt tttggctcat tagaaagcag ggcaaag 



atg tea etc aat caa aaa gaa atg tat caa gta ttg atg cag caa gtc 
Met Ser Leu Asn Gin Lys Glu Met Tyr Gin Val Leu Met Gin Gin Val 
1 5 10 15 

cac tta gaa gaa cac eta caa gac cga ccc ctt ctt aaa gec ggc agt 
His Leu Glu Glu His Leu Gin Asp Arg Pro Leu Leu Lys Ala Gly Ser 

20 25 30 , 

ttg aag caa att gtt gtt tac aag get caa caa gec tgg gac ctg acc 
Leu Lys Gin lie Val Val Tyr Lys Ala Gin Gin Ala Trp Asp Leu Thr 
35 40 45 

etc caa ttt cct cag ate etc cct ttt aag gac ttc caa gtt ttg gag 
Leu Gin Phe Pro Gin lie Leu Pro Phe Lys Asp Phe Gin Val Leu Glu 
50 55 60 

tct gee etc ttg cag cat ate cca gaa gtc aac cag ate cat tta agg 
Ser Ala Leu Leu Gin His lie Pro Glu Val Asn Gin He His Leu Arg 
65 70 75 80 

gtt gat gee caa gat gac agt ttt gac cag gac etc etc cag gac tat 
Val Asp Ala Gin Asp Asp Ser Phe Asp Gin Asp Leu Leu Gin Asp Tyr 

85 90 95 

tgg cct aag gcg gtg aag ttt age gga gtc gat tct ccc ctt tgc aat 
Trp Pro Lys Ala Val Lys Phe Ser Gly Val Asp Ser Pro Leu Cys Asn 

100 105 110 

gac tta eta gac aag acc etc cct tat eta gat ggg aag caa gtt tac 
Asp Leu Leu Asp Lys Thr Leu Pro Tyr Leu Asp Gly Lys Gin Val Tyr 
115 120 125 

ttt gac ctg gac cat gaa gtg acc egg gac aag ttt gac cat gac ttc 
Phe Asp Leu Asp His Glu Val Thr Arg Asp Lys Phe Asp His Asp Phe 
130 135 140 

eta cct egg ate caa get ggc tac cag caa gtg ggc ttt ccc aac cac 
Leu Pro Arg He Gin Ala Gly Tyr Gin Gin Val Gly Phe Pro Asn His 
145 150 155 160 



ttt aaa ate aag get agg gtc gat gec cag aaa aat tea gat caa att 
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Phe Lys lie Lys Ala Arg Val Asp Ala Gin Lys Asn Ser Asp Gin lie 

165 170 175 

gcc gcc ttc cgt aaa gaa aaa gaa gaa aaa gac cag gcc ttg tct caa 633 
Ala Ala Phe Arg Lys Glu Lys Glu Glu Lys Asp Gin Ala Leu Ser Gin 

180 185 190 



gag eta acc aac caa ttt ate aag gcc age caa aag aaa gaa gaa ggg 
Glu Leu Thr Asn Gin Phe lie Lys Ala Ser Gin Lys Lys Glu Glu Gly 
195 200 205 



cgt ctg acc ttt gaa gga tac gtt ttt gat gtg gaa ate aaa tec etc 
Arg Leu Thr Phe Glu Gly Tyr Val Phe Asp Val Glu He Lys Ser Leu 

245 250 255 



tec ttc eta ttc aaa aaa ttc tct aat aat tct tct gac gaa gcc eta 
Ser Phe Leu Phe Lys Lys Phe Ser Asn Asn Ser Ser Asp Glu Ala Leu 
275 280 285 



681 



gga tec aaa gcc aag teg gag gcc ttg aag atg ggc egg gcc ate cct 729 
Gly Ser Lys Ala Lys Ser Glu Ala Leu Lys Met Gly Arg Ala He Pro 
210 215 220 

gac cac gaa acg att acc cag atg gtt gat gtg gaa gaa gaa gag age 777 
Asp His Glu Thr He Thr Gin Met Val Asp Val Glu Glu Glu Glu Ser 
225 230 235 240 



825 



egg tea gat aga aag etc ctt etc ttt aaa atg acc gac tat age tct 873 
Arg Ser Asp Arg Lys Leu Leu Leu Phe Lys Met Thr Asp Tyr Ser Ser 

260 265 270 



921 



ttt gac caa gtc caa gag gga atg tgg 
Phe Asp Gin Val Gin Glu Gly Met Trp 
290 295 

caa gaa gat acc ttt gtc aaa gac eta 
Gin Glu Asp Thr Phe Val Lys Asp Leu 
305 310 

caa gag gtc aaa aaa gaa ccc egg egg 
Gin Glu Val Lys Lys Glu Pro Arg Arg 

325 



etc aag gtt aga ggc agt gtt 969 
Leu Lys Val Arg Gly Ser Val 
300 

gtt gtc atg gcc caa gac ate 1017 
Val Val Met Ala Gin Asp He 
315 320 

gac ctg get aag gaa ggg gag . 1065 
Asp Leu Ala Lys Glu Gly Glu 
330 335 



aag agg gtg gaa ctt cat 
Lys Arg Val Glu Leu His 

340 

ttg gtg ccg gcc aag gat 
Leu Val Pro Ala Lys Asp 
355 

ccg get att gcc ate act 
Pro Ala He Ala He Thr 
370 

gcc cat tat get ggc tta 
Ala His Tyr Ala Gly Leu 



gcc cat acc acc atg agt 
Ala His Thr Thr Met Ser 
345 

ttg gtc aag caa gca gcc 
Leu Val Lys Gin Ala Ala 
360 

gat cat get gta gtc caa 
Asp His Ala Val Val Gin 
375 380 

gac act ggt gtt aaa att 
Asp Thr Gly Val Lys He 



cag atg gac ggt 1113 
Gin Met Asp Gly 
350 

get ttt gac caa 1161 

Ala Phe Asp Gin 

365 

tec ttc cca gag 1209 
Ser Phe Pro Glu 



ctt tac ggt gtg 1257 
Leu Tyr Gly Val 
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385 390 395 400 

gaa gcc aat ttg gtt agt gat ggc gaa ttg gta gca tac aat ccg gcc 1305 
Glu Ala Asn Leu Val Ser Asp Gly Glu Leu Val Ala Tyr Asn Pro Ala 

405 410 415 

gat ata aag ctg gaa gag gca act tat gtg gtc ttc gac gtg gaa aca 1353 
Asp lie Lys Leu Glu Glu Ala Thr Tyr Val Val Phe Asp Val Glu Thr 

420 425 430 

acc gga eta teg get cgt tat gac caa ate att gaa ttg gcc get gtg 14.01 
Thr Gly Leu Ser Ala Arg Tyr Asp Gin lie lie Glu Leu Ala Ala Val 
435 440 445 

aag atg gaa aat ggg gaa ate gtt tct gaa ttc caa gaa ttt att gac 1449 
Lys Met Glu Asn Gly Glu lie Val Ser Glu Phe Gin Glu Phe lie Asp 
450 455 460 

cca ggc cag ccc ttg tct gag act acg acc aat ttg acc ggg ate acc 1497 
Pro Gly Gin Pro Leu Ser Glu Thr Thr Thr Asn Leu Thr Gly He Thr 
465 470 475 480 

gat gac atg gtc caa gga tec aaa agt gaa gac gaa gtc etc cat gcc 1545 
Asp Asp Met Val Gin Gly Ser Lys Ser Glu Asp Glu Val Leu His Ala 

485 490 495 

ttt caa gcc ttt tea gaa ggc act gtc ttg gtc gcc cat aac get tec 1593 
Phe Gin Ala Phe Ser Glu Gly Thr Val Leu Val Ala His Asn Ala Ser 

500 505 510 

ttt gac atg ggc ttt ate aat acg gcc tac caa cga cat ggc eta gga 1641 
Phe Asp Met Gly Phe He Asn Thr Ala Tyr Gin Arg His Gly Leu Gly 
515 520 525 

caa get gac cag cct gtg att gat acc ttg gaa ttg tec cgc atg etc 1689 
Gin Ala Asp Gin Pro Val He Asp Thr Leu Glu Leu Ser Arg Met Leu 
530 535 540 

cac cca aac ttg aaa age cac egg tta aac act ctg get aag egg tat 1737 
His Pro Asn Leu Lys Ser His Arg Leu Asn Thr Leu Ala Lys Arg Tyr 
545 550 555 560 

gac gtg gcc tta gaa cac cac cac egg gcc ate tat gac teg gag tea 1785 
Asp Val Ala Leu Glu His His His Arg Ala He Tyr Asp Ser Glu Ser 

565 570 575 

acg get aaa etc ttg tgg ate ttc tta aaa gaa gcc aaa gac caa tat 1833 
Thr Ala Lys Leu Leu Trp He Phe Leu Lys Glu Ala Lys Asp Gin Tyr 

580 585 590 

gac atg act age cac caa gac ttg aat age cag gtg ggg gaa ggc gag 1881 
Asp Met Thr Ser His Gin Asp Leu Asn Ser Gin Val Gly Glu Gly Glu 
595 600 605 

get tac aag cag gcc egg cca acc cat gcc agt att ttg gtc aag aat 1929 
Ala Tyr Lys Gin Ala Arg Pro Thr His Ala Ser He Leu Val Lys Asn 
610 615 620 
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caa aaa ggc ttg aaa aac etc ttt aaa att gtc tec cac gee cat gtc 1977 
Gin Lys Gly Leu Lys Asn Leu Phe Lys lie Val Ser His Ala His Val 
625 630 635 640 

aac tac ttc tac egg gtt ccc cgt ata cct aag tct ate ttg age aag 2025 
Asn Tyr Phe Tyr Arg Val Pro Arg lie Pro Lys Ser lie Leu Ser Lys 

645 650 655 

tac egg gaa ggc ctt ttg gtt ggg tct ggt tgc gga cag gga gag etc 2073 
Tyr Arg Glu Gly Leu Leu Val Gly Ser Gly Cys Gly Gin Gly Glu Leu 

660 665 670 

ttt gag get att atg caa aag ggc tat gac gaa gee ttg gca gtt gee 2121 
Phe Glu Ala He Met Gin Lys Gly Tyr Asp Glu Ala Leu Ala Val Ala 
675 680 685 

cag gac tat gat tat att gaa gtt atg ccc aag tea gec tat att gac 2169 
Gin Asp Tyr Asp Tyr He Glu Val Met Pro Lys Ser Ala Tyr He Asp 
690 695 700 

etc ttg gac egg gac tta ate aag gat gag gca ace ctt gaa gaa atg 2217 
Leu Leu Asp Arg Asp Leu He Lys Asp Glu Ala Thr Leu Glu Glu Met 
705 710 715 720 

att gaa aac ctg gtt aaa ata ggc cat gaa ctt gat ata ccc gtg gta 2265 
He Glu Asn Leu Val Lys He Gly His Glu Leu Asp He Pro Val Val 

725 730 735 

get aca ggg aat gtc cac tac eta aac cca gaa gat gee gtt tta egg 2313 
Ala Thr Gly Asn Val His Tyr Leu Asn Pro Glu Asp Ala Val Leu Arg 

740 745 750 

gat ate etc ctg gaa act gee aaa aag gga gec ttc tec aaa gee egg 2361 
Asp He Leu Leu Glu Thr Ala Lys Lys Gly Ala Phe Ser Lys Ala Arg 
755 760 765 

aac cca gaa gtc cac ttt aga aca aca gat gaa atg tta gaa gag ttt 2409 
Asn Pro Glu Val His Phe Arg Thr Thr Asp Glu Met Leu Glu Glu Phe 
770 775 780 

tec ttc eta ggc cag gac cag get tat gag att gtg gtc ace aac ace 2457 
Ser Phe Leu Gly Gin Asp Gin Ala Tyr Glu He Val Val Thr Asn Thr 
785 790 795 800 

caa aaa att get gat tct ate gaa tea ate tct cct gtc aag gaa ggc 2505 
Gin Lys He Ala Asp Ser He Glu Ser He Ser Pro Val Lys Glu Gly 

805 810 815 

etc tat gec ccg aaa atg gaa ggg teg gac caa gag ata cgt cag atg 2553 
Leu Tyr Ala Pro Lys Met Glu Gly Ser Asp Gin Glu He Arg Gin Met 

820 825 830 

agt tac aag caa gee aag get etc tat ggc gac ccc ttg cca agt att 2601 
Ser Tyr Lys Gin Ala Lys Ala Leu Tyr Gly Asp Pro Leu Pro Ser He 
835 840 845 
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gta gag gaa agg etc gaa aaa gag ttg aag agt att att gac aac aat 2649 
Val Glu Glu Arg Leu Glu Lys Glu Leu Lys Ser He He Asp Asn Asn 
850 855 860 

ttc tct gtc att tac tta att tec cag aaa ttg gtc aaa aaa agt gtt 2697 
Phe Ser Val He Tyr Leu He Ser Gin Lys Leu Val Lys Lys Ser Val 
865 870 875 880 

gaa gat ggc tat ttg gtt ggt tec agg ggg teg gtt ggg tea age ttt 2745 
Glu Asp Gly Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe 

885 890 895 

gtg gec acc atg acc ggg ate aca gaa gtc aac cca eta ccg ccc cac 2793 
Val Ala Thr Met Thr Gly He Thr Glu Val Asn Pro Leu Pro Pro His 

900 905 910 

tac cgc tgt cct aac tgc cag cac acc gaa ttc ttc aca aat ggg gaa 2841 
Tyr Arg Cys Pro Asn Cys Gin His Thr Glu Phe Phe Thr Asn Gly Glu 
915 920 925 

gtg ggg tec ggc ttt gac tta gag gee aaa aaa tgt ccg gaa tgt caa 2889 
Val Gly Ser Gly Phe Asp Leu Glu Ala Lys Lys Cys Pro Glu Cys Gin 
930 935 940 

age eta atg gaa tea gac ggc cac gac att ccc ttc gaa acc ttc ctt 2937 
Ser Leu Met Glu Ser Asp Gly His Asp He Pro Phe Glu Thr Phe Leu 
945 950 955 960 

ggt ttt aat ggg gac aag gtg cca gat ate gat ttg aac ttc tea ggt 2985 
Gly Phe Asn Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly 

965 970 975 

gaa tac cag gee aag gee cac aac tat acc aag gtt ttg ttt gga gaa 3033 
Glu Tyr Gin Ala Lys Ala His Asn Tyr Thr Lys Val Leu Phe Gly Glu 

980 985 990 

gac cat gtc tac egg gca ggg acc ate acg acg att get gac aag acg 3 081 

Asp His Val Tyr Arg Ala Gly Thr He Thr Thr He Ala Asp Lys Thr 
995 1000 1005 

gee ttt ggt ttt gtc aag ggt tat gaa agg gac aag cag ata aac tac 3129 
Ala Phe Gly Phe Val Lys Gly Tyr Glu Arg Asp Lys Gin lie Asn Tyr 
1010 1015 1020 

egg teg get gaa gtg gac egg ctg tea gat ggt tta acc gga gtg aga 3177 
Arg Ser Ala Glu Val Asp Arg Leu Ser Asp Gly Leu Thr Gly Val Arg 
1025 1030 1035 1040 

egg tea acc ggc cag cac cca gga ggg att ate gtc ata ccg gat gac 3225 
Arg Ser Thr Gly Gin His Pro Gly Gly He He Val He Pro Asp Asp 

1045 1050 1055 

atg gat gtg ttt gat ttc acc ccc ate cag tac ccg get gac gac cag 3273 
Met Asp Val Phe Asp Phe Thr Pro He Gin Tyr Pro Ala Asp Asp Gin 

1060 1065 1070 



acg get gag tgg caa act acc cac ttt gac ttc cac tec ate gac gaa 



3321 
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Thr Ala Glu Trp Gin Thr Thr His Phe Asp Phe His Ser He Asp Glu 
1°75 1080 1085 

aac gtc ttg aag ctg gat ate ctg gga cat gat gac ccg acc atg ate 3369 
Asn Val Leu Lys Leu Asp He Leu Gly His Asp Asp Pro Thr Met He 
1090 1095 1100 

cga aaa etc cag gac ttg tec ggc ttt gac cct caa gaa ata ccg gta 3417 
Arg Lys Leu Gin Asp Leu Ser Gly Phe Asp Pro Gin Glu He Pro Val 
1105 mo U15 1120 

agt gat gaa gat gtt atg aaa att ttc tea ggc ccg gaa gtt eta ggg 3465 
Ser Asp Glu Asp Val Met Lys He Phe Ser Gly Pro Glu Val Leu Gly 

1125 1130 1135 

gtg acc cca gag caa att ttc tec aat acc gga act etc gga gta cct 3 513 

Val Thr Pro Glu Gin He Phe Ser Asn Thr Gly Thr Leu Gly Val Pro 

II 40 1145 1150 

gaa ttt ggt acc caa ttt gtc cga . gaa atg tta gag caa acc cac ccc 3561 
Glu Phe Gly Thr Gin Phe Val Arg Glu Met Leu Glu Gin Thr His Pro 
1155 1160 1165 

tct acc ttt get gaa etc ttg cag ate tea ggc etc tec cac ggg aca 3609 
Ser Thr Phe Ala Glu Leu Leu Gin He Ser Gly Leu Ser His Gly Thr 
1170 1175 1180 

gat gtt tgg ctg ggc aat get gaa gaa tta att cgc aac cac aac att 3 657 

Asp Val Trp Leu Gly Asn Ala Glu Glu Leu He Arg Asn His Asn He 
1185 1190 1195 1200 

ccc ttg tec gag gtg ate ggc tgc egg gat gat ate atg gtc tac ctt 37 05 

Pro Leu Ser Glu Val He Gly Cys Arg Asp Asp He Met Val Tyr Leu 

1205 1210 1215 

caa cac caa ggt ctt gaa gac age ctg gec ttt aag att atg gaa ttt- 3753 
Gin His Gin Gly Leu Glu Asp Ser Leu Ala Phe Lys He Met Glu Phe 

I 220 1225 1230 

gtt cgt aag ggt egg ggc ttg caa gat gac tgg att get acc atg aaa 3 801 

Val Arg Lys Gly Arg Gly Leu Gin Asp Asp Trp He Ala Thr Met Lys 
1235 1240 1245 

gaa aat gat gtt cct gat tgg tat att gaa tec tgc aaa aaa ate aag 3 849 

Glu Asn Asp Val Pro Asp Trp Tyr He Glu Ser Cys Lys Lys He Lys 
1250 1255 1260 

tac atg ttc cct aaa gee cac gca get gee tat gtc ttg atg gec ctt 3897 
Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu Met Ala Leu 
1265 1270 1275 1280 

agg gta get tac ttt aaa gtc cac tac ccc ctt tac tac tac get gec 3945 
Arg Val Ala Tyr Phe Lys Val His Tyr Pro Leu Tyr Tyr Tyr Ala Ala 

1285 1290 - 1295 

tac ttt tec ate egg get agt gat ttt gac tta att get atg gtc aag 3993 
Tyr Phfe Ser He Arg Ala Ser Asp Phe Asp Leu He Ala Met Val Lys 
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1300 



1305 



1310 



ggc aag gaa ggc att aaa ggg get atg aag gaa ate agg gac aag gaa 

Gly Lys Glu Gly lie Lys Gly Ala Met Lys Glu lie Arg Asp Lys Glu 
1315 1320 1325 

aga gaa aaa act gec aca get aag gac aaa gec ttg etc ace gtc ctt 

Arg Glu Lys Thr Ala Thr Ala Lys Asp Lys Ala Leu Leu Thr Val Leu 
1330 " 1335 1340 



4041 



4089 



gaa gta gec aat gaa atg gtt gaa egg ggt ttt gac ttc aag atg gtg 4137 

Glu Val Ala Asn Glu Met Val Glu Arg Gly Phe Asp Phe Lys Met Val 
1345 1350 1355 1360 

gac ate aac aag tec caa gee aaa gac ttt gtc ate gaa gac aat ggc 4185 

Asp lie Asn Lys Ser Glu Ala Lys Asp Phe Val lie Glu Asp Asn Gly 

1365 1370 1375 



ctt cgt get cca ttt agg gca gtc cct tec ttg ggg tec agt gee gee 

Leu Arg Ala Pro Phe Arg Ala Val Pro Ser Leu Gly Ser Ser Ala Ala 

1380 1385 1390 

cag get gtc att gat gee agg gag gac age gac ttc ttg tec aag gaa 

Gin Ala Val lie Asp Ala Arg Glu Asp Ser Asp Phe Leu Ser Lys Glu 
1395 1400 1405 



4233 



4281 



gac eta tea aaa egg ggc aag ttg teg aaa acg gtc atg gac tac ctg 
Asp Leu Ser Lys Arg Gly Lys Leu Ser Lys Thr Val Met Asp Tyr Leu 
1410 1415 1420 



4329 



gac aat aac cac gtt tta gac cac ctg ccg gac gaa aac caa ctt tec 4377 
Asp Asn Asn His Val Leu Asp His Leu Pro Asp Glu Asn Gin Leu Ser 
1425 1430 1435 1440 

etc ttt gac ttt taa 4392 
Leu Phe Asp Phe 



<210> 70 
<211> 1444 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 70 

Met Ser Leu Asn Gin Lys Glu Met Tyr Gin Val Leu Met Gin Gin Val 
15 10 15 



His Leu Glu Glu His Leu Gin Asp Arg Pro Leu Leu Lys Ala Gly Ser 

20 25 30 



Leu Lys Gin He Val Val Tyr Lys Ala Gin Gin Ala Trp Asp Leu Thr 
35 40 45 



Leu Gin Phe Pro Gin He Leu Pro Phe Lys Asp Phe Gin Val Leu Glu 
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50 55 60 



Ser Ala Leu Leu Gin His lie Pro Glu Val Asn Gin lie His Leu Arg 
65 70 75 80 



Val Asp Ala Gin Asp Asp Ser Phe Asp Gin Asp Leu Leu Gin Asp Tyr 

85 90 95 



Trp Pro Lys Ala Val Lys Phe Ser Gly Val Asp Ser Pro Leu Cys Asn 

100 105 110 



Asp Leu Leu Asp Lys Thr Leu Pro Tyr Leu Asp Gly Lys Gin Val Tyr 
115 120 125 



Phe Asp Leu Asp His Glu Val Thr Arg Asp Lys Phe Asp His Asp Phe 
130 135 140 



Leu Pro Arg lie Gin Ala Gly Tyr Gin Gin Val Gly Phe Pro Asn His 
145 150 155 160 



Phe Lys lie Lys Ala Arg Val Asp Ala Gin Lys Asn Ser Asp Gin lie 

165 170 175 



Ala Ala Phe Arg Lys Glu Lys Glu Glu Lys Asp Gin Ala Leu Ser Gin 

180 185 190 



Glu Leu Thr Asn Gin Phe lie Lys Ala Ser Gin Lys Lys Glu Glu Gly 
195 200 205 



Gly Ser Lys Ala Lys Ser Glu Ala Leu Lys Met Gly Arg Ala He Pro 
210 215 220 



Asp His Glu Thr He Thr Gin Met Val Asp Val Glu Glu Glu Glu Ser 
225 230 235 240 



Arg Leu Thr Phe Glu Gly Tyr Val Phe Asp Val Glu He Lys Ser Leu 

245 250 255 



Arg Ser Asp Arg Lys Leu Leu Leu Phe Lys Met Thr Asp Tyr Ser Ser 

260 265 " 270 



Ser Phe Leu Phe Lys Lys Phe Ser Asn Asn Ser Ser Asp Glu Ala Leu 
275 280 285 
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Phe Asp Gin Val Gin Glu Gly Met Trp Leu Lys Val Arg Gly Ser Val 
290 295 300 

Gin Glu Asp Thr Phe Val Lys Asp Leu Val Val Met Ala Gin Asp He 
305 310 315 320 

Gin Glu Val Lys Lys Glu Pro Arg Arg Asp Leu Ala Lys Glu Gly Glu 

325 330 335 

Lys Arg Val Glu Leu His Ala His Thr Thr Met Ser Gin Met Asp Gly 

340 345 3 50 

r 

Leu Val Pro Ala Lys Asp Leu Val Lys Gin Ala Ala Ala Phe Asp Gin 
355 360 365 

Pro Ala He Ala He Thr Asp His Ala Val Val Gin Ser Phe Pro Glu 
370 375 380 

Ala His Tyr Ala Gly Leu Asp Thr Gly Val Lys He Leu Tyr Gly Val 
385 390 395 400 

Glu Ala Asn Leu Val Ser Asp Gly Glu Leu Val Ala Tyr Asn Pro Ala 

405 410 415 

Asp He Lys Leu Glu Glu Ala Thr Tyr Val Val Phe Asp Val Glu Thr 

420 425 430 

Thr Gly Leu Ser Ala Arg Tyr Asp Gin He He Glu Leu Ala Ala Val 
435 440 445 

Lys Met Glu Asn Gly Glu He Val Ser Glu Phe Gin Glu Phe He Asp 
450 455 460 

Pro Gly Gin Pro Leu Ser Glu Thr Thr Thr Asn Leu Thr Gly He Thr 
465 470 475 480 

Asp Asp Met Val Gin Gly Ser Lys Ser Glu Asp Glu Val Leu His Ala 

485 490 495 



Phe Gin Ala Phe Ser Glu Gly Thr Val Leu Val Ala His Asn Ala Ser 

500 505 510 
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Phe Asp Met Gly Phe He Asn Thr Ala Tyr Gin Arg His Gly Leu Gly 
515 520 525 

Gin Ala Asp Gin Pro Val lie Asp Thr Leu Glu Leu Ser Arg Met Leu 
530 535 540 

His Pro Asn Leu Lys Ser His Arg Leu Asn Thr Leu Ala Lys Arg Tyr 
545 550 555 560 

Asp Val Ala Leu Glu His His His Arg Ala He Tyr Asp Ser Glu Ser 

565 570 575 

Thr Ala Lys Leu Leu Trp lie Phe Leu Lys Glu Ala Lys Asp Gin Tyr 

580 585 590 

Asp Met Thr Ser His Gin Asp Leu Asn Ser Gin Val Gly Glu Gly Glu 
595 600 605 

Ala Tyr Lys Gin Ala Arg Pro Thr His Ala Ser He Leu Val Lys Asn 
610 ^ 615 620 

Gin Lys Gly Leu Lys Asn Leu Phe Lys He Val Ser His Ala His Val 
625 630 635 640 

Asn Tyr Phe Tyr Arg Val Pro Arg He Pro Lys Ser He Leu Ser Lys 

645 650 655 

Tyr Arg Glu Gly Leu Leu Val Gly Ser Gly Cys Gly Gin Gly Glu Leu 

660 665 670 

Phe Glu Ala He Met Gin Lys Gly Tyr Asp Glu Ala Leu Ala Val Ala 
675 680 685 

Gin Asp Tyr Asp Tyr He Glu Val Met Pro Lys Ser Ala Tyr He Asp 
690 695 700 

Leu Leu Asp Arg Asp Leu He Lys Asp Glu Ala Thr Leu Glu Glu Met 
705 710 715 720 

He Glu Asn Leu Val Lys He Gly His Glu Leu Asp He Pro Val Val 

725 730 735 
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Ala Thr Gly Asn Val His Tyr Leu Asn Pro Glu Asp Ala Val Leu Arg 

740 745 750 

Asp lie Leu Leu Glu Thr Ala Lys Lys Gly Ala Phe Ser Lys Ala Arg 
755 760 765 

Asn Pro Glu Val His Phe Arg Thr Thr Asp Glu Met Leu Glu Glu Phe 
770 775 780 

Ser Phe Leu Gly Gin Asp Gin Ala Tyr Glu lie Val Val Thr Asn Thr 
785 790 795 800 

Gin Lys lie Ala Asp Ser lie Glu Ser He Ser Pro Val Lys Glu Gly 

805 810 815 

Leu Tyr Ala Pro Lys Met Glu Gly Ser Asp Gin Glu He Arg Gin Met 

820 825 830 

Ser Tyr Lys Gin Ala Lys Ala Leu Tyr Gly Asp Pro Leu Pro Ser He 
835 840 845 

Val Glu Glu Arg Leu Glu Lys Glu Leu Lys Ser He He Asp Asn Asn 
850 855 860 

Phe Ser Val He Tyr Leu He Ser Gin Lys Leu Val Lys Lys Ser Val 
865 870 875 880 

Glu Asp Gly Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe 

885 890 895 

Val Ala Thr Met Thr Gly He Thr Glu Val Asn Pro Leu Pro Pro His 

900 905 910 

Tyr Arg Cys Pro Asn Cys Gin His Thr Glu Phe Phe Thr Asn Gly Glu 
915 920 925 

Val Gly Ser Gly Phe Asp Leu Glu Ala Lys Lys Cys Pro Glu Cys Gin 
930 935 940 

Ser Leu Met Glu Ser Asp Gly His Asp He Pro Phe Glu Thr Phe Leu 
945 950 955 960 

Gly Phe Asn Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly 
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965 970 975 



Glu Tyr Gin Ala Lys Ala His Asn Tyr Thr Lys Val Leu Phe Gly Glu 

980 985 990 



Asp His Val Tyr Arg Ala Gly Thr He Thr Thr He Ala Asp Lys Thr 
995 1000 1005 



Ala Phe Gly Phe Val Lys Gly Tyr Glu Arg Asp Lys Gin He Asn Tyr 
1010 1015 1020 

Arg Ser Ala Glu Val Asp Arg Leu Ser Asp Gly Leu Thr Gly Val Arg 
1025 1030 1035 1040 

Arg Ser Thr Gly Gin His Pro Gly Gly He He Val He Pro Asp Asp 

1045 1050 1055 



Met Asp Val Phe Asp Phe Thr Pro He Gin Tyr Pro Ala Asp Asp Gin 

1060 1065 1070 



Thr Ala Glu Trp Gin Thr Thr His Phe Asp Phe His Ser He Asp Glu 
1075 1080 1085 



Asn Val Leu Lys Leu Asp He Leu Gly His Asp Asp Pro Thr Met He 
1090 1095 1100 

Arg Lys Leu Gin Asp Leu Ser Gly Phe Asp Pro Gin Glu He Pro Val 
1105 1110 1115 1120 

Ser Asp Glu Asp Val Met Lys He Phe Ser Gly Pro Glu Val Leu Gly 

1125 1130 1135 



Val Thr Pro Glu Gin He Phe Ser Asn Thr Gly Thr Leu Gly Val Pro 

1140 1145 1150 



Glu Phe Gly Thr Gin Phe Val Arg Glu Met Leu Glu Gin Thr His Pro 
1155 1160 1165 



Ser Thr Phe Ala Glu Leu Leu Gin He Ser Gly Leu Ser His Gly Thr 
1170 1175 1180 



Asp Val Trp Leu Gly Asn Ala Glu Glu Leu He Arg Asn His Asn He 
1185 1190 1195 1200 
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Pro Leu Ser Glu Val lie Gly Cys Arg Asp Asp lie Met Val Tyr Leu 

1205 1210 1215 



Gin His Gin Gly Leu Glu Asp Ser Leu Ala Phe Lys lie Met Glu Phe 

1220 1225 1230 



Val Arg Lys Gly Arg Gly Leu Gin Asp Asp Trp lie Ala Thr Met Lys 
1235 1240 1245 



Glu Asn Asp Val Pro Asp Trp Tyr He Glu Ser Cys Lys Lys He Lys 
1250 1255 1260 



Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu Met Ala Leu 
1265 1270 1275 1280 



Arg Val Ala Tyr Phe Lys Val His Tyr Pro Leu Tyr Tyr Tyr Ala Ala 

1285 1290 1295 



Tyr Phe Ser He Arg Ala Ser Asp Phe Asp Leu He Ala Met Val Lys 

1300 1305 1310 



Gly Lys Glu Gly He Lys Gly Ala Met Lys Glu He Arg Asp Lys Glu 
1315 1320 1325 



Arg Glu Lys Thr Ala Thr Ala Lys Asp Lys Ala Leu Leu Thr Val Leu 
1330 ~ 1335 1340 



Glu Val Ala Asn Glu Met Val Glu Arg Gly Phe Asp Phe Lys Met Val 
1345 1350 1355 1360 



Asp He Asn Lys Ser Gin Ala Lys Asp Phe Val He Glu Asp Asn Gly 

1365 1370 1375 



Leu Arg Ala Pro Phe Arg Ala Val Pro Ser Leu Gly Ser Ser Ala Ala 

1380 1385 1390 



Gin Ala Val He Asp Ala Arg Glu Asp Ser Asp Phe Leu Ser Lys Glu 
1395 1400 1405 



Asp Leu Ser Lys Arg Gly Lys Leu Ser Lys Thr Val Met Asp Tyr Leu 
1410 1415 1420 
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Asp Asn Asn His Val Leu Asp His Leu Pro Asp Glu Asn Gin Leu Ser 
1425 1430 1435 1440 



Leu Phe Asp Phe 



<210> 71 
<211> 1326 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (19) . . (1326) 

<223> 

<400> 71 

aagaaaggag gactcaat atg tct atg ttt gtc gac tac acc aaa gtt aac 51 

Met Ser Met Phe Val Asp Tyr Thr Lys Val Asn 
15 10 



ctg aga gcc ggt aag ggc ggt gac gga atg gtg get ttt aga cga gaa 
Leu Arg Ala Gly Lys Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu 

15 20 25 



ccg cct gga acc att ate egg gat gcc caa agt aag get ata ctt get 
Pro Pro Gly Thr lie He Arg Asp Ala Gin Ser Lys Ala He Leu Ala 

95 100 105 

gac tta caa gaa gaa gga caa gaa gtc ttg gca gcc caa ggt ggc egg 
Asp Leu Gin Glu Glu Gly . Gin Glu Val Leu Ala Ala Gin Gly Gly Arg 
110 115 120 



99 



aag tat gag ccc aat ggt gga cca gca ggc ggc gac ggt ggc agt ggc 147 
Lys Tyr Glu Pro Asn Gly Gly Pro Ala Gly Gly Asp Gly Gly Ser Gly 
30 35 40 

ggt aac att ate ttc aag gta gat gaa ggc etc cgt acc ctg gta gac 195 
Gly Asn He He Phe Lys Val Asp Glu Gly Leu Arg Thr Leu Val Asp 
45 50 55 

ttc cgc tac aac ccc cat ttt aag gca gat agt ggc caa aat ggt atg 243 
Phe Arg Tyr Asn Pro His Phe Lys Ala Asp Ser Gly Gin Asn Gly Met 
60 65 70 75 

ccc aag ggg atg aat ggt aag aag gca gag gac ttg att ate agt gtc 291 
Pro Lys Gly Met Asn Gly Lys Lys Ala Glu Asp Leu He He Ser Val 

80 85 90 



339 



387 



gga ggt egg ggc aat aaa cgt ttt get acg cat aag aac cca gca ccc 435 
Gly Gly Arg Gly Asn Lys Arg Phe Ala Thr His Lys Asn Pro Ala Pro 
125 ~ 130 135 



tec att gcc gaa aac ggc gag ccg ggc caa gag egg gat gtc gaa ttg 
Ser He Ala Glu Asn Gly Glu Pro Gly Gin Glu Arg Asp Val Glu Leu 
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140 145 150 155 

gaa tta aaa gtc atg gcc gat gtt ggc eta gtg ggt tat cct tct gtc 531 

Glu Leu Lys Val Met Ala Asp Val Gly lieu Val Gly Tyr Pro Ser Val 

160 165 170 

ggg aaa teg ace ctt ttg teg gtt gtc tea ggc get aaa ccc aaa att 579 

Gly Lys Ser Thr Leu Leu Ser Val Val Ser Gly Ala Lys Pro Lys lie 

175 180 185 

gga gcc tat cac ttt act aca ctt gcc cct aat tta ggt gta gtg aat 627 

Gly Ala Tyr His Phe Thr Thr Leu Ala Pro Asn Leu Gly Val Val Asn 
190 195 200 

gca gtg gac ggc aag gaa ttt gtc ttg gcg gat att cct ggc tta att 67 5 

Ala Val Asp Gly Lys Glu Phe Val Leu Ala Asp lie Pro Gly Leu lie 
205 210 215 

gaa ggg get tea gaa ggg gtt ggt ttg ggg att gac ttc etc aag cat 723 

Glu Gly Ala Ser Glu Gly Val Gly Leu Gly lie Asp Phe Leu Lys His 

220 225 230 235 

att gaa aga ace cgc ate etc ctt cat gta ctt gat atg age gga atg 771 

He Glu Arg Thr Arg He Leu Leu His Val Leu Asp Met Ser Gly Met 

240 245 250 

gaa ggt cgc cat cca att gat gat ttt gac cag att aac caa gaa eta 819 

Glu Gly Arg His Pro He Asp Asp Phe Asp Gin He Asn Gin Glu Leu 

255 260 265 

aaa gac tat aat gag aaa tta ttg gac cgc aag cag gtc att gtg gcc 867 

Lys Asp Tyr Asn Glu Lys Leu Leu Asp Arg Lys Gin Val He Val Ala 
270 275 280 

aat aaa atg gac ctg ccc cag tec egg gat aat tta ate gaa ttt aaa 915 

Asn Lys Met Asp Leu Pro Gin Ser Arg Asp Asn Leu He Glu Phe Lys 
285 290 295 

gcc gag tta gac age egg gac ctt gac tat gaa ate ttt gaa gtg tea 963 

Ala Glu Leu Asp Ser Arg Asp Leu Asp Tyr Glu He Phe Glu Val Ser 

300 305 310 315 

get gcc ace cag get ggc att cag gac eta gtc ate cga eta gcc gac 1011 

Ala Ala Thr Gin Ala Gly He Gin Asp Leu Val He Arg Leu Ala Asp 

320 325 330 

tta gtc gac caa ctg gac caa gcc cca agt .tta gac cag gaa gaa act 1059 

Leu Val Asp Gin Leu Asp Gin Ala Pro Ser Leu Asp Gin Glu Glu Thr 

335 340 345 

agt gaa gcc gac caa aga gtg gtc tac aag ttt caa get gac caa gac 1107 

Ser Glu Ala Asp Gin Arg Val Val Tyr Lys Phe Gin Ala Asp Gin Asp 
350 355 360 

aaa ttt gac ctt gac cgc gac cct gaa ggg gta tgg ttg gtt tct ggt 1155 

Lys Phe Asp Leu Asp Arg Asp Pro Glu Gly Val Trp Leu Val Ser Gly 
365 370 375 
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ccc aag gtt gag cgt ttg tat gcc atg acc aat ttt gac cac gag gaa 
Pro Lys Val Glu Arg Leu Tyr Ala Met Thr Asn Phe Asp His Glu Glu 
380 " 385 390 395 



1203 



gcc att atg egg ttt tct cgc cag eta aga ggg atg gga gta gac caa 1251 
Ala He Met Arg Phe Ser Arg Gin Leu Arg Gly Met Gly Val Asp Gin 

400 405 410 

gcc tta aga gac aag ggg get cag tct ggt gac etc gtc caa gtt gaa 1299 
Ala Leu Arg Asp Lys Gly Ala Gin Ser Gly Asp Leu Val Gin Val Glu 

415 420 425 



gat ttt gtc ttt gag ttc atg gat tag 
Asp Phe Val Phe Glu Phe Met Asp 
430 435 



<210> 72 
<211> 435 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 72 

Met Ser Met Phe Val Asp Tyr Thr Lys Val Asn Leu Arg Ala Gly Lys 
15 10 15 



Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Glu Pro Asn 

20 25 30 



Gly Gly Pro Ala Gly Gly Asp Gly Gly Ser Gly Gly Asn He He Phe 
35 40 45 



Lys Val Asp Glu Gly Leu Arg Thr Leu Val Asp Phe Arg Tyr Asn Pro 
50 ^ 55 60 



His Phe Lys Ala Asp Ser Gly Gin Asn Gly Met Pro Lys Gly Met Asn 
65 ' " 70 75 80 



Gly Lys Lys Ala Glu Asp Leu He He Ser Val Pro Pro Gly Thr He 

85 90 95 



He Arg Asp Ala Gin Ser Lys Ala He Leu Ala Asp Leu Gin Glu Glu 

100 105 110 



Gly Gin Glu Val Leu Ala Ala Gin Gly Gly Arg Gly Gly Arg Gly Asn 
115 120 125 



1326 



Lys Arg Phe Ala Thr His Lys Asn Pro Ala Pro Ser He Ala Glu Asn 
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130 135 140 

Gly Glu Pro Gly Gin Glu Arg Asp Val Glu Leu Glu Leu Lys Val Met 
145 150 155 160 

Ala Asp Val Gly Leu Val Gly Tyr Pro Ser Val Gly Lys Ser Thr Leu 

165 170 175 

Leu Ser Val Val Ser Gly Ala Lys Pro Lys He Gly Ala Tyr His Phe 

180 185 190 

Thr Thr Leu Ala Pro Asn Leu' Gly Val Val Asn Ala Val Asp Gly Lys 
195 200 205 

Glu Phe Val Leu Ala Asp lie Pro Gly Leu He Glu Gly Ala Ser Glu 
210 215 220 

Glv Val Gly Leu Gly He Asp Phe Leu Lys His He Glu Arg Thr Arg 
225 230 235 240 

He Leu Leu His Val Leu Asp Met Ser Gly Met Glu Gly Arg His Pro 

245 250 255 

He Asp Asp Phe Asp Gin He Asn Gin Glu Leu Lys Asp Tyr Asn Glu 

260 " 265 270 

Lys Leu Leu Asp Arg Lys Gin Val He Val Ala Asn Lys Met Asp Leu 
275 280 285 

Pro Gin Ser Arg Asp Asn Leu He Glu Phe Lys Ala Glu Leu Asp Ser 
290 295 300 

Arg Asp Leu Asp Tyr Glu He Phe Glu Val Ser Ala Ala Thr Gin Ala 
305 310 315 320 

Gly He Gin Asp Leu Val He Arg Leu Ala Asp Leu Val Asp Gin Leu 

325 330 335 

Asp Gin Ala Pro Ser Leu Asp Gin Glu Glu Thr Ser Glu Ala Asp Gin 

340 345 350 



Arg Val Val Tyr Lys Phe Gin Ala Asp Gin Asp Lys Phe Asp Leu Asp 
355 360 365 
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Arg Asp Pro Glu Gly Val Trp Leu Val Ser Gly Pro Lys Val Glu Arg 
370 375 380 



Leu Tyr Ala Met Thr Asn Phe Asp His Glu Glu Ala lie Met Arg Phe 
385 390 395 400 



Ser Arg Gin Leu Arg Gly Met Gly Val Asp Gin Ala Leu Arg Asp Lys 

405 410 415 



Gly Ala Gin Ser Gly Asp Leu Val Gin Val Glu Asp Phe Val Phe Glu 

420 425 43 0 



Phe Met Asp 
435 



<210> 73 
<211> 1338 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (25) . . (1338) 
<223> 



y 



<400> 73 

aagagaaaga aagaaggtgt actg atg get aat cct tta gta gec ata ate 51 

Met Ala Asn Pro Leu Val Ala lie lie 
1 5 

ggc egg cct aat gtc ggc aag tea act att ttc aac egg att att gga 99 
Gly Arg Pro Asn Val Gly Lys Ser Thr lie Phe Asn Arg He He Gly 
10 15 20 25 

gac cgc tta gee att gtc cag gat gaa ccc ggg gtc ace egg gac cgt 147 
Asp Arg Leu Ala He Val Gin Asp Glu Pro Gly Val Thr Arg Asp Arg 

30 35 40 

att tat gee gat get gaa tgg ttg ggc aaa gac ttt tct gtt ata gat 195 
He Tyr Ala. Asp Ala Glu Trp Leu Gly Lys Asp Phe Ser Val He Asp 

45 50 55 

acg gga gga ate act ttt gat gat ttg ccc ttg cat gaa gaa ata aaa 243 
Thr Gly Gly He Thr Phe Asp Asp Leu Pro Leu His Glu Glu He Lys 
60 65 - 70 

gtc caa get gaa att gee att gat gaa gca gat gtc ate gtc atg gta 291 
Val Gin Ala Glu He Ala He Asp Glu Ala Asp Val He Val Met Val 
75 80 85 
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acc agt gtc aaa gag ggc att aca gac ttg gat gac cag gta gcc tta 
Thr Ser Val Lys Glu Gly He Thr Asp Leu Asp Asp Gin Val Ala Leu 
90 95 100 105 

att ttg cag cag tec aac aaa ccc gtg gtc ctt get gtt aat aaa aca 
He Leu Gin Gin Ser Asn Lys Pro Val Val Leu Ala Val Asn Lys Thr 

110 115 120 

gat aat cct gag ctt aga aat gaa ata tat gag ttt tac ggg tta ggc 
Asp Asn Pro Glu Leu Arg Asn Glu He Tyr Glu Phe Tyr Gly Leu Gly 

125 130 135 

ttg ggt gac ccc ctt ccg gta tec ggg tct cac ggc eta ggc ttt ggg 
Leu Gly Asp Pro Leu Pro Val Ser Gly Ser His Gly Leu Gly Phe Gly 
140 145 150 

gac etc tta gac gca gtg gtg gcc aac ttt cct aat gag gcc aat atg 
Asp Leu Leu Asp Ala Val Val Ala Asn Phe Pro Asn Glu Ala Asn Met 
155 160 165 

get tat gac caa gat acc att aag ttc tgc ttg att ggt cgt ccc aat 
Ala Tyr Asp Gin Asp Thr He Lys Phe Cys Leu He Gly Arg Pro Asn 
170 175 180 185 

gtt ggc aag tct age eta gtt aat get att att ggg gaa gac egg gtt 
Val Gly Lys Ser Ser Leu Val Asn Ala He He Gly Glu Asp Arg Val 

190 195 200 

ata gtc tct gaa eta gaa ggg acc acc egg gat gca att gac act ccc 
He Val Ser Glu Leu Glu Gly Thr Thr Arg Asp Ala He Asp Thr Pro 

205 210 215 



339 



ate egg cgt egg ggc aag gtc tat gaa aaa act gaa aag tat tct gtt 
He Arg Arg Arg Gly Lys Val Tyr Glu Lys Thr Glu Lys Tyr Ser Val 
235 240 245 

atg egg gca cag cga get ate gac egg tct gat gtg gtc ttg tgt gtc 
Met Arg Ala Gin Arg Ala He Asp Arg Ser Asp Val Val Leu Cys Val 
250 ~ 255 260 265 

ctg gat get gaa aca ggc att aga gac caa gat aag aag gtt ttc ggc 
Leu Asp Ala Glu Thr Gly He Arg Asp Gin Asp Lys Lys Val Phe Gly 

270 275 280 

tat get cat caa gcc ggc aag gga att att att tta gtc aat aag tgg 
Tyr Ala His Gin Ala Gly Lys Gly He He He Leu Val Asn Lys Trp 

285 290 295 

gac acg att aaa aaa gag act aac acc atg cga gac ttt gag ttg caa 
Asp Thr He Lys Lys Glu Thr Asn Thr Met Arg Asp Phe Glu Leu Gin 
300 305 310 

att cgc gac caa ttc cgc tac etc cac tat gcc cca ate ctt ttc gtc 



387 



435 



483 



531 



579 



627 



675 



ttt atg acc cag gat ggc cag gac tat gtt atg ate gat act get ggg 723 
Phe Met Thr Gin Asp Gly Gin Asp Tyr Val Met He Asp Thr Ala Gly 
220 225 230 



771 



819 



867 



915 



963 



1011 
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He Arg Asp Gin Phe Arg Tyr Leu His Tyr Ala Pro He Leu Phe Val 
315 320 325 

tea gec aag acc aag cag aga ctg gaa gtc ate ccg gaa ttg gtc gac 
Ser Ala Lys Thr Lys Gin Arg Leu Glu Val He Pro Glu Leu Val Asp 
330 335 340 345 

egg gtc tat tat aac cgc aat caa egg gtc aag tec tec etc tta aat 
Arg Val Tyr Tyr Asn Arg Asn Gin Arg Val Lys Ser Ser Leu Leu Asn 

350 355 360 



ggg aag cga etc aag gtc ttt tat gcg acc cag gta gee act aat cca 
Gly Lys Arg Leu Lys Val Phe Tyr Ala Thr Gin Val Ala Thr Asn Pro 
380 385 390 

cct act ttt gtg gtt ttt gtc aat gat cct gac etc atg cac ttc tec 
Pro Thr Phe Val Val Phe Val Asn Asp Pro Asp Leu Met His Phe Ser 
395 400 405 

tat gag cgc ttt tta gaa aat cga ttc cgc gaa age ttt gac ttc tat 
Tyr Glu Arg Phe Leu Glu Asn Arg Phe Arg Glu Ser Phe Asp Phe Tyr 
410 ~ 415 420 425 

ggc act ccg att cag ata ate cct aga gca agg aaa taa 
Gly Thr Pro He Gin He He Pro Arg Ala Arg Lys 

430 435 



1059 



1107 



gat gtg ctg agt gat gca eta gee age aat cct gca cct agt aag tea 115 5 

Asp Val Leu Ser Asp Ala Leu Ala Ser Asn Pro Ala Pro Ser Lys Ser 

365 370 375 



1203 



1251 



1299 



1338 



<210> 74 
<211> 437 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 74 

Met Ala Asn Pro Leu Val Ala lie He Gly Arg Pro Asn Val Gly Lys 
X 5 10 15 



Ser Thr He Phe Asn Arg He He Gly Asp Arg Leu Ala He Val Gin 

20 25 30 

Asp Glu Pro Gly Val Thr Arg Asp Arg He Tyr Ala Asp Ala Glu Trp 
35 40 45 

Leu Gly Lys Asp Phe Ser Val He Asp Thr Gly Gly He Thr Phe Asp 
50 55 60 



Asp Leu Pro Leu His Glu Glu He Lys Val Gin Ala Glu He Ala He 
65 70 75 80 
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Asp Glu Ala Asp Val lie Val Met Val Thr Ser Val Lys Glu Gly He 

85 90 95 



Thr Asp Leu Asp Asp Gin Val Ala Leu He Leu Gin Gin Ser Asn Lys 

100 105 HO 



Pro Val Val Leu Ala Val Asn Lys Thr Asp Asn Pro Glu Leu Arg Asn 
115 120 125 

Glu He Tyr Glu Phe Tyr Gly Leu Gly Leu Gly Asp Pro Leu Pro Val 
130 135 140 

Ser Gly Ser His Gly Leu Gly Phe Gly Asp Leu Leu Asp Ala Val Val 
145 150 155 160 

Ala Asn Phe Pro Asn Glu Ala Asn Met Ala Tyr Asp Gin Asp Thr He 

165 170 175 

Lys Phe Cys Leu He Gly Arg Pro Asn Val Gly Lys Ser Ser Leu Val 

180 185 190 

Asn Ala He He Gly Glu Asp Arg Val He Val Ser Glu Leu Glu Gly 
195 200 205 

Thr Thr Arg Asp Ala He Asp Thr Pro Phe Met Thr Gin Asp Gly Gin 
210 ~ 215 220 

Asp Tyr Val Met He Asp Thr Ala Gly He Arg Arg Arg Gly Lys Val 
225 230 235 240 

Tyr Glu Lys Thr Glu Lys Tyr Ser Val Met Arg Ala Gin Arg Ala He 

245 250 255 

Asp Arg Ser Asp Val Val Leu Cys Val Leu Asp Ala Glu Thr Gly He 

260 265 - 270 

Arg Asp Gin Asp Lys Lys Val Phe Gly Tyr Ala His Gin Ala Gly Lys 
275 280 285 



Gly lie He He Leu Val Asn Lys Trp Asp Thr He Lys Lys Glu Thr 
290 295 300 
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Asn Thr Met Arg Asp Phe Glu Leu Gin lie Arg Asp Gin Phe Arg Tyr 
305 310 315 320 

Leu His Tyr Ala Pro lie Leu Phe Val Ser Ala Lys Thr Lys Gin Arg 

325 330 335 



Leu Glu Val lie Pro Glu Leu Val Asp Arg Val Tyr Tyr Asn Arg Asn 

340 345 350 

Gin Arg Val Lys Ser Ser Leu Leu Asn Asp Val Leu Ser Asp Ala Leu 
355 ** 360 365 

Ala Ser Asn Pro Ala Pro Ser Lys Ser Gly Lys Arg Leu Lys Val Phe 
370 375 380 

Tyr Ala Thr Gin Val Ala Thr Asn Pro Pro Thr Phe Val Val Phe Val 
385 390 395 400 

* 

Asn Asp Pro Asp Leu Met His Phe Ser Tyr Glu Arg Phe Leu Glu Asn 

405 410 415 

Arg Phe Arg Glu Ser Phe Asp Phe Tyr Gly Thr Pro lie Gin He He 

420 425 430 



Pro Arg Ala Arg Lys 
435 



<210> 75 
<211> 3324 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (10).. (3324) 

<223> 

<400> 75 

aataaaaga ttg aaa caa ata tgt ctt aga cga aga ggt gac aag atg act 
Met Lys Gin He Cys Leu Arg Arg Arg Gly Asp Lys Met Thr 
1 5 10 

ttt acc cac tta caa gtg acc agt get tac acc ttg atg get teg acc 
Phe Thr His Leu Gin Val Thr Ser Ala Tyr Thr Leu Met Ala Ser Thr 
15 20 25 30 

ate caa ttg ccc etc ctg atg gac cgc ctg aag gag ctt ggc atg gag 
He Gin Leu Pro Leu Leu Met Asp Arg Leu Lys Glu Leu Gly Met Glu 



51 



99 



147 



gac gtc cgt tgc ttg gaa gaa age caa gtc tec act ttg gaa ate tta 
Asp Val Arg Cys Leu Glu Glu Ser Gin Val Ser Thr Leu Glu He Leu 

195 . 200 205 

age cac ate aaa gee aac cag aaa att caa ttt gac acc cag get egg 
Ser His lie Lys Ala Asn Gin Lys He Gin Phe Asp Thr Gin Ala Arg 

210 215 220 

gaa aat tat gee ctg cgc agt ccc caa gaa atg gag tct ttt ttt aac 
Glu Asn Tyr Ala Leu Arg Ser Pro Gin Glu Met Glu Ser Phe Phe Asn 
225 230 235 



tea gta gac tgg tec ctg gac etc ggt cag get aaa ttg cct gca ttt 
Ser Val Asp Trp Ser Leu Asp Leu Gly Gin Ala Lys Leu Pro Ala Phe 
255 260 265 270 



291 



339 
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35 40 45 

get gtt gee ttg acc gac cac aat gtt atg cat gga gcg gtc gaa ttt 195 
Ala Val Ala Leu Thr Asp His Asn Val Met His Gly Ala Val Glu Phe 

50 55 60 

tac caa gaa gee aaa aag cat ggc att aaa ccc att atg gga eta egg 243 
Tyr Gin Glu Ala Lys Lys His Gly He Lys Pro He Met Gly Leu Arg 
65 70 75 

get gac eta gac gaa gga ata acc gtc acc etc ctg get aaa aac aag 
Ala Asp Leu Asp Glu Gly He Thr Val Thr Leu Leu Ala Lys Asn Lys* 
80 85 90 

get ggc tac cag get etc tta gee tta teg act gac ctt caa gtt aac 
Ala Gly Tyr Gin Ala Leu Leu Ala Leu Ser Thr Asp Leu Gin Val Asn 
95 "* " 100 105 HO 

aag cag get att aca ctt gac caa gtc cgt tct gtg gee cag gac etc 
Lys Gin Ala He Thr Leu Asp Gin Val Arg Ser Val Ala Gin Asp Leu 

115 120 125 

tat aca ata ttc cca age tct gac cca aaa gtg aaa gca gac etc tta 
Tyr Thr He Phe Pro Ser Ser Asp Pro Lys Val Lys Ala Asp Leu Leu 

130 135 140 

gat aag cag gca age aat ttg acc gcg atg act cag aac ctg ccc cat 
Asp Lys Gin Ala Ser Asn Leu Thr Ala Met Thr Gin Asn Leu Pro His 
145 150 155 

tea tat ttg ggt ctg gtg cca gac caa gat caa aaa att tac cag tta 
Ser Tyr Leu Gly Leu Val Pro Asp Gin Asp Gin Lys He Tyr Gin Leu 
160 165 170 

gee egg acc ttg tea gat tct gga ggt ttg aaa gtc tta gec tta tct 579 
Ala Arg Thr Leu Ser Asp Ser Gly Gly Leu Lys Val Leu Ala Leu Ser 
175 ~ 180 185 190 



387 



435 



483 



531 



627 



675 



723 



cag gtg ggt tta ggt cag gee ctt aaa aat act aaa gat gta gee cag 771 
Gin Val Gly Leu Gly Gin Ala Leu Lys Asn Thr Lys Asp Val Ala Gin 
240 245 250 



819 
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gac ctg ccg gaa ggg gag acc aag gac tec tac ctt ggc aag ctt gec 

Asp Leu Pro Glu Gly Glu Thr Lys Asp Ser Tyr Leu Gly Lys Leu Ala 

275 280 285 

caa aaa gga etc caa gaa egg gtt cca ggc tac ggc caa gac tac caa 

Gin Lys Gly Leu Gin Glu Arg Val Pro Gly Tyr Gly Gin Asp Tyr Gin 

290 295 300 

gac cgt eta gac aag gaa eta gcg gtt att tct tec atg ggc ttt teg 

Asp Arg Leu Asp Lys Glu Leu Ala Val lie Ser Ser Met Gly Phe Ser 

305 310 315 



867 



aaa att gag act ggt ttt ggc egg ggg tea get gee get tct ttg gta 
Lys He Glu Thr Gly Phe Gly Arg Gly Ser Ala Ala Ala Ser Leu Val 
335 340 345 350 

tct tat gee etc tac att acg ggg gta gat ccc ate cat . tat gac etc 
Ser Tyr Ala Leu Tyr He Thr Gly Val Asp Pro He His Tyr Asp Leu 

355 360 365 

etc ttt gaa cgt ttt ttg aac aag gac cgc ttt acc atg cct gat att 
Leu Phe Glu Arg Phe Leu Asn Lys Asp Arg Phe Thr Met Pro Asp He 

370 375 380 

gac eta gac ttc cca gac aac aag cgc cag gtc ate ttg gac tat gtc 
Asp Leu Asp Phe Pro Asp Asn Lys Arg Gin Val He Leu Asp Tyr Val 
385 390 395 



acc ttt gcg get aag tec tec ate agg gaa att atg egg acc ttg ggt 
Thr Phe Ala Ala Lys Ser Ser He Arg Glu He Met Arg Thr Leu Gly 
415 420 425 430 



aaa ctg gtc cag caa age cat gaa aat gag egg ate ttt gee atg gee 
Lys Leu Val Gin Gin Ser His Glu Asn Glu Arg He Phe Ala Met Ala 
465 470 475 



915 



963 



gac tac ttc ctg att gtt tgg gac ctg atg caa ttt gec cgc cag gaa 1011 
Asp Tyr Phe Leu He Val Trp Asp Leu Met Gin Phe Ala Arg Gin Glu 
320 325 330 



1059 



1107 



1155 



1203 



tac egg aag tat ggt cct gac cat gtg gee caa att ttg acc ttt ggg 1251 
Tyr Arg Lys Tyr Gly Pro Asp His Val Ala Gin He Leu Thr Phe Gly 
400 405 410 



1299 



tac aag aat gaa gac atg aag acc tgg tec cag gec ata cca gat acc 1347 
Tyr Lys Asn Glu Asp Met Lys Thr Trp Ser Gin Ala He Pro Asp Thr 

435 440 445 

gtc aac ate age ttg tea aag gec tat gac gag teg aaa gac ctt caa 13 95 

Val Asn He Ser Leu Ser Lys Ala Tyr Asp Glu Ser Lys Asp Leu Gin 

450 455 460 



1443 



cag gat ate gaa ggc ctg cca agg aac tat tea acc cat gcg gec ggt 1491 
Gin Asp He Glu Gly Leu Pro Arg Asn Tyr Ser Thr His Ala Ala Gly 
480 485 490 
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1539 



560 565 570 

ctt ttt gcc egg gga gac aca aat ggg gtc ttc caa ttt gaa aaa gag 
Leu Phe Ala Arg Gly Asp Thr Asn Gly Val Phe Gin Phe Glu Lys Glu 
575 580 585 590 

gga ate aaa aaa gtc etc cgc cag ctt caa ccc act tct ttt gaa gat 
Gly lie Lys Lys Val Leu Arg Gin Leu Gin Pro Thr Ser Phe Glu Asp 

595 600 605 

ate gtc gcc ace aac gcc etc tac cgc ccc ggt ccc atg ggg caa att 
lie Val Ala Thr Asn Ala Leu Tyr Arg Pro Gly Pro Met Gly Gin lie 

610 615 620 

gag aat tat att aac cgt aaa cat ggt caa gaa aaa att ate tac ccc 
Glu Asn Tyr lie Asn Arg Lys His Gly Gin Glu Lys lie lie Tyr Pro 
625 630 635 

cat gaa gac tta aag gac ate ctt gaa gtc act tat ggc att att gtc 
His Glu Asp Leu Lys Asp lie Leu Glu Val Thr Tyr Gly He He Val 
640 645 650 

tac cag gaa caa gtc atg cag gta get ace caa eta get ggc tat agt 
Tyr Gin Glu Gin Val Met Gin Val Ala Thr Gin Leu Ala Gly Tyr Ser 
655 660 665 670 

ttg teg gaa get gac caa ttg egg egg act atg tec aaa aaa ate cag 
Leu Ser Glu Ala Asp Gin Leu Arg Arg Thr Met Ser Lys Lys He Gin 

675 680 685 



aag ggc tac agt gag tea gta gcc cga gag gtt tat aac tat att gca 
Lys Gly Tyr Ser Glu Ser Val Ala Arg Glu Val Tyr Asn Tyr He Ala 
705 710 715 

aag ttt get aac tac ggc ttt aac cgt gcc cat get gtt gcc tac tec 



1587 



1635 



gtc gtc atg tea gac cag ccc eta ate cat tec ctt ccc eta caa gat 
Val Val Met Ser Asp Gin Pro Leu He His Ser Leu Pro Leu Gin Asp 
495 500 505 510 

ggc aac gga aag gtc ccc aac acc caa ttt ace atg gag gat gtt gaa 
Gly Asn Gly Lys Val Pro Asn Thr Gin Phe Thr Met Glu Asp Val Glu 

515 520 525 

gcg gtc ggc tta etc aag atg gac ttt ttg agt tta aaa aat tta acc 
Ala Val Gly Leu Leu Lys Met Asp Phe Leu Ser Leu Lys Asn Leu Thr 

530 535 540 

ate eta gca gac tgc ttg aac ttt age cag tat gaa ggg cag gga ggg 1683 
He Leu Ala Asp Cys Leu Asn Phe Ser Gin Tyr Glu Gly Gin Gly Gly 
545 550 555 

ggt ata agt aaa caa gat ata cca ate gac gac cct aag acc ctg gat 1731 
Gly He Ser Lys Gin Asp He Pro He Asp Asp Pro Lys Thr Leu Asp 



1779 



1827 



1875 



1923 



1971 



2019 



2067 



tea gaa atg gac cag gga egg gaa aaa ttt ata aga gga gcc ttg gac 2115 
Ser Glu Met Asp Gin Gly Arg Glu Lys Phe He Arg Gly Ala Leu Asp 

690 695 700 



2163 



2211 
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Lys Phe Ala Asn Tyr Gly Pile Asn Arg Ala His Ala Val Ala Tyr Ser 
720 725 730 

atg ctt gcc tac cat atg gcc tac ttt aag gtc cac cag cct aaa tct 

Met Leu Ala Tyr His Met Ala Tyr Phe Lys Val His Gin Pro Lys Ser 

7*^ 740 745 750 



cca gac ate aac caa age ctt gga tct ttt acg gtt egg cag aat ggc 
Pro Asp He Asn Gin Ser Leu Gly Ser Phe Thr Val Arg Gin Asn Gly 
785 790 795 



cgt gac ttt tgt gaa aaa att gac age caa ttc tta agt caa gac ccc 
Arg Asp Phe Cys Glu Lys He Asp Ser Gin Phe Leu Ser Gin Asp Pro 

835 840 845 

att gaa gca ttg att ttg gtg ggg gcc ttt gac caa atg ggc cct aat 
lie Glu Ala Leu He Leu Val Gly Ala Phe Asp Gin Met Gly Pro Asn 

850 855 860 

egg egg acc atg tta gcg ggc ttg gaa gca acg att gaa ttc gtg gcc 
Arg Arg Thr Met Leu Ala Gly Leu Glu Ala Thr He Glu Phe Val Ala 
865 870 875 

aaa agt teg ggc aat ate acc ctt ttt gac act etc aag ccc cgc caa 
Lys Ser Ser Gly Asn He Thr Leu Phe Asp Thr Leu Lys Pro Arg Gin 
880 885 890 

gaa gac ctg gaa gag ttt age cca aag gac etc att caa tat gaa gaa 
Glu Asp Leu Glu Glu Phe Ser Pro Lys Asp Leu He Gin Tyr Glu Glu 
895 900 905 910 

gaa tta acc ggt ttt tac ttc tec age cac ccc ttg age egg tat gac 
Glu Leu Thr Gly Phe Tyr Phe Ser Ser His Pro Leu Ser Arg Tyr Asp 

915 920 925 

tec ctg cga cag gac tta aaa acg tec ttt ata get gat tta gaa gag 
Ser Leu Arg Gin Asp Leu Lys Thr Ser Phe He Ala Asp Leu Glu Glu 

930 935 940 

ggc caa tct tgc caa gtt tta ggt cag ctg gtt caa gtc egg aaa act 
Gly Gin Ser Cys Gin Val Leu Gly Gin. Leu Val Gin Val Arg Lys Thr 



2259 



2307 



ttt ttt gcg get gtg atg aag gca gac tgg ggt aac aag get aaa att 
Phe Phe Ala Ala Val Met Lys Ala Asp Trp Gly Asn Lys Ala Lys He 

755 760 765 

tac aag tat gcc cat gaa gtc egg get aga aaa att aaa eta eta aaa 2355 
Tyr Lys Tyr Ala His Glu Val Arg Ala Arg Lys He Lys Leu Leu Lys 

770 775 780 



2403 



att caa gtg ggg ctt aag atg gtc aag ggg gtg get age ccc ttt gtc 2451 

He Gin Val Gly Leu Lys Met Val Lys Gly Val Ala Ser Pro Phe Val 
800 805 810 

aac cac ate ctt gaa att egg aaa gaa aag gga get ttt acc age ctg 

Asn His He Leu Glu He Arg Lys Glu Lys Gly Ala Phe Thr Ser Leu 
815 820 825 830 



2499 



2547 



2595 



2643 



2691 



2739 



2787 



2835 



2883 
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945 950 955 

cag act aga aac caa caa ccc atg gcc ttt gtt age ctg get gac caa 
Gin Thr Arg Asn Gin Gin Pro Met Ala Phe Val Ser Leu Ala Asp Gin 
960 965 970 

aca gga caa att age ctg gtg gtc ttt ccg aat gta tac cgc gaa tgc 
Thr Gly Gin lie Ser Leu Val Val Phe Pro Asn Val Tyr Arg Glu Cys 
975 980 985 990 

eta cct tac etc aaa gaa gga gtg gtc ctg gtc gtc tea ggc aag gta 
Leu Pro Tyr Leu Lys Glu Gly Val Val Leu Val Val Ser Gly Lys Val 

995 1000 1005 

gaa gtt agg aag gga gaa ate cag eta aaa gtc cag ace atg aaa gag 
Glu Val Arg Lys Gly Glu lie Gin Leu Lys Val Gin Thr Met Lys Glu 

1010 1015 1020 



gac ttg aac caa gat aaa gaa agt ttt cgt caa gtg caa aag ate ttg 
Asp Leu Asn Gin Asp Lys Glu Ser Phe Arg Gin Val Gin Lys lie Leu 
1040 1045 1050 

gcc cga cat ccc ggc cag aag cga gtg att gtt tac gac cag gcc age 
Ala Arg His Pro Gly Gin Lys Arg Val lie Val Tyr Asp Gin Ala Ser 
1055 1060 1065 1070 

cag caa gca etc cag etc aaa gca aaa ttt aat ttc gac gga egg acg 
Gin Gin Ala Leu Gin Leu Lys Ala Lys Phe Asn Phe Asp Gly Arg Thr 

1075 1080 1085 

gat ace eta aac cag etc cag gac etc eta ggc cag gat tct tgt ate 
Asp Thr Leu Asn Gin Leu Gin Asp Leu Leu Gly Gin Asp Ser Cys lie 

1090 1095 1100 

tta aaa taa 
Leu Lys. 



<210> 76 
<211> 1104 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 76 

Met Lys Gin lie Cys Leu Arg Arg Arg Gly Asp Lys Met Thr Phe Thr 
15 10 15 

His Leu Gin Val Thr Ser Ala Tyr Thr Leu Met Ala Ser Thr He Gin 

20 25 30 



2931 



2979 



3027 



3075 



gcc age cag gtc caa aaa gag act aag cag ctt tac ctg aaa ttt get 3123 
Ala Ser Gin Val Gin Lys Glu Thr Lys Gin Leu Tyr Leu Lys Phe Ala 
1025 1030 1035 



3171 



3219 



3267 



3315 



3324 



Leu Pro Leu Leu Met Asp Arg Leu Lys Glu Leu Gly Met Glu Ala Val 
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Ala Leu Thr Asp His Asn Val Met His Gly Ala Val Glu Phe Tyr Gin 
50 55 60 

Glu Ala Lys Lys His Gly He Lys Pro lie Met Gly Leu Arg Ala Asp 
65 70 75 80 

Leu Asp Glu Gly lie Thr Val Thr Leu Leu Ala Lys Asn Lys Ala Gly 

85 90 95 

Tyr Gin Ala Leu Leu Ala Leu Ser Thr Asp Leu Gin Val Asn Lys Gin 

100 105 HO 

Ala lie Thr Leu Asp Gin Val Arg Ser Val Ala Gin Asp Leu Tyr Thr 
115 120 125 

lie Phe Pro Ser Ser Asp Pro Lys Val Lys Ala Asp Leu Leu Asp Lys 
130 135 140 

Gin Ala Ser Asn Leu Thr Ala Met Thr Gin Asn Leu Pro His Ser Tyr 
145 150 155 160 

Leu Gly Leu Val Pro Asp Gin Asp Gin Lys lie Tyr Gin Leu Ala Arg 

165 170 175 

Thr Leu Ser Asp Ser Gly Gly Leu Lys Val Leu Ala Leu Ser Asp Val 

180 185 190 

Arg Cys Leu Glu Glu Ser Gin Val Ser Thr Leu Glu He Leu Ser His 
195 200 205 

He Lys Ala Asn Gin Lys He Gin Phe Asp Thr Gin Ala Arg Glu Asn 
210 215 220 

Tyr Ala Leu Arg Ser Pro Gin Glu Met Glu Ser Phe Phe Asn Gin Val 
225 230 235 240 

Gly Leu Gly Gin Ala Leu Lys Asn Thr Lys Asp Val Ala Gin Ser Val 

245 250 255 



Asp Trp Ser Leu Asp Leu Gly Gin Ala Lys Leu Pro Ala Phe Asp Leu 

260 265 270 
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Pro Glu Gly Glu Thr Lys Asp Ser Tyr Leu Gly Lys Leu Ala Gin Lys 
275 280 285 

Gly Leu Gin Glu Arg Val Pro Gly Tyr Gly Gin Asp Tyr Gin Asp Arg 
290 295 300 

Leu Asp Lys Glu Leu Ala Val lie Ser Ser Met Gly Phe Ser Asp Tyr 
305 310 315 320 

Phe Leu lie Val Trp Asp Leu Met Gin Phe Ala Arg Gin Glu Lys lie 

325 330 335 



Glu Thr Gly Phe Gly Arg Gly Ser Ala Ala Ala Ser Leu Val Ser Tyr 

340 345 . 350 

Ala Leu Tyr lie Thr Gly Val Asp Pro lie His Tyr Asp Leu Leu Phe 
355 360 365 

Glu Arg Phe Leu Asn Lys Asp Arg Phe Thr Met Pro Asp lie Asp Leu 
370 375 380 

Asp Phe Pro Asp Asn Lys Arg Gin Val He Leu Asp Tyr Val Tyr Arg 
385 390 395 400 

Lys Tyr Gly Pro Asp His Val Ala Gin He Leu Thr Phe Gly Thr Phe 

405 410 415 



Ala Ala Lys Ser Ser He Arg Glu He Met Arg Thr Leu Gly Tyr Lys 

420 425 430 

Asn Glu Asp Met Lys Thr Trp Ser Gin Ala He Pro Asp Thr Val Asn 
435 440 445 

He Ser Leu Ser Lys Ala Tyr Asp Glu Ser Lys Asp Leu Gin Lys Leu 
450 455 460 

Val Gin Gin Ser His Glu Asn Glu Arg He Phe Ala Met Ala Gin Asp 
465 470 475 480 

He Glu Gly Leu Pro Arg Asn Tyr Ser Thr His Ala Ala Gly Val Val 

485 490 495 
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Met Ser Asp Gin Pro Leu lie His Ser Leu Pro Leu Gin Asp Gly Asn 

500 505 510 

Gly Lys Val Pro Asn Thr Gin Phe Thr Met Glu Asp Val Glu Ala Val 
515 520 525 

Gly Leu Leu Lys Met Asp Phe Leu Ser Leu Lys Asn Leu Thr He Leu 
530 535 540 

Ala Asp Cys Leu Asn Phe Ser Gin Tyr Glu Gly Gin Gly Gly Gly He 
545 550 555 560 

Ser Lys Gin Asp He Pro He Asp Asp Pro Lys Thr Leu Asp Leu Phe 

565 570 575 

Ala Arg Gly Asp Thr Asn Gly Val Phe Gin Phe Glu Lys Glu Gly He 

580 585 590 

Lys Lys Val Leu Arg Gin Leu Gin Pro Thr Ser Phe Glu Asp He Val 
595 600 605 

Ala Thr Asn Ala Leu Tyr Arg Pro Gly Pro Met Gly Gin He Glu Asn 
610 615 620 

Tyr He Asn Arg Lys His Gly Gin Glu Lys He He Tyr Pro His Glu 
625 630 635 640 

Asp Leu Lys Asp He Leu Glu Val Thr Tyr Gly He He Val Tyr Gin 

645 650 655 

Glu Gin Val Met Gin Val Ala Thr Gin Leu Ala Gly Tyr Ser Leu Ser 

660 665 670 

Glu Ala Asp Gin Leu Arg Arg Thr Met Ser Lys Lys He Gin Ser Glu 
675 680 685 

Met Asp Gin Gly Arg Glu Lys Phe lie Arg Gly Ala Leu Asp Lys Gly 
■ 690 695 700 

Tyr Ser Glu Ser Val Ala Arg Glu Val Tyr Asn Tyr He Ala Lys Phe 
705 710 715 720 
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Ala Asn Tyr Gly Phe Asn Arg Ala His Ala Val Ala Tyr Ser Met Leu 

725 730 735 

Ala Tyr His Met Ala Tyr Phe Lys Val His Gin Pro Lys Ser Phe Phe 

740 745 750 

Ala Ala Val Met Lys Ala Asp Trp Gly Asn Lys Ala Lys lie Tyr Lys 
755 760 765 

Tyr Ala His Glu Val Arg Ala Arg Lys He Lys Leu Leu Lys Pro Asp 
770 775 780 

He Asn Gin Ser Leu Gly Ser Phe Thr Val Arg Gin Asn Gly He Gin 
785 790 795 800 

Val Gly Leu Lys Met Val Lys Gly Val Ala Ser Pro Phe Val Asn His 

805 810 815 

He Leu Glu He Arg Lys Glu Lys Gly Ala Phe Thr Ser Leu Arg Asp 

820 825 830 

Phe Cys Glu Lys He Asp Ser Gin Phe Leu Ser Gin Asp Pro He Glu 
835 840 845 

Ala Leu He Leu Val Gly Ala Phe Asp Gin Met Gly Pro Asn Arg Arg 
850 855 860 

Thr Met Leu Ala Gly Leu Glu Ala Thr He Glu Phe Val Ala Lys Ser 
865 870 875 880 

Ser Gly Asn He Thr Leu Phe Asp Thr Leu Lys Pro Arg Gin Glu Asp 

885 890 895 

Leu Glu Glu Phe Ser Pro Lys Asp Leu He Gin Tyr Glu Glu Glu Leu 

900 905 910 

Thr Gly Phe Tyr Phe Ser Ser His Pro Leu Ser Arg Tyr Asp Ser Leu 
915 920 925 

Arg Gin Asp Leu Lys Thr Ser Phe He Ala Asp Leu Glu Glu Gly Gin 
930 " 935 940 

Ser Cys Gin Val Leu Gly Gin Leu Val Gin Val Arg Lys Thr Gin Thr 
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945 



950 955 960 



Arg Asn Gin Gin Pro Met Ala Phe Val Ser Leu Ala Asp Gin Thr Gly 

965 970 975 

Gin He Ser Leu Val Val Phe Pro Asn Val Tyr Arg Glu Cys Leu Pro 

980 985 990 

Tyr Leu Lys Glu Gly Val Val Leu Val Val Ser Gly Lys Val Glu Val 
995 1000 1005 

Arg Lys Gly Glu He Gin Leu Lys Val Gin Thr Met Lys Glu Ala Ser 
1010 1015 1020 

Gin Val Gin Lys Glu Thr Lys Gin Leu Tyr Leu Lys Phe Ala Asp Leu 
1025 1030 1035 1040 

Asn Gin Asp Lys Glu Ser Phe Arg Gin Val Gin Lys He Leu Ala Arg 

1045 1050 1055 

His Pro Gly Gin Lys Arg Val He Val Tyr Asp Gin Ala Ser Gin Gin 

1060 1065 1070 

Ala Leu Gin Leu Lys Ala Lys Phe Asn Phe Asp Gly Arg Thr Asp Thr 
1075 1080 1085 

Leu Asn Gin Leu Gin Asp Leu Leu Gly Gin Asp Ser Cys He Leu Lys 
1090 1095 HOO 



<210> 77 
<211> .1212 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (1212) 

<223> 

<400> 77 

acaaag atg ctg aaa aat aaa aag ata gcc tta tat gtt act ggt ggt 

Met Leu Lys Asn Lys Lys He Ala Leu Tyr Val Thr Gly Gly 
15 10 



48 
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ata gca gta tac aaa tea ctt tac tta ctt agg gaa ate ate aaa caa 
He Ala Val Tyr Lys Ser Leu Tyr Leu Leu Arg Glu He He Lys Gin 
15 20 25 30 

ggc ggg gag gtc egg gtt gee atg act caa gca get tgt caa ttt gtt 
Gly Gly Glu Val Arg Val Ala Met Thr Gin Ala Ala Cys Gin Phe Val 

3 5 40 45 

aac ccc tta tct ttt cag gtt tta age caa aaa aag gtt cag att gac 
Asn Pro Leu Ser Phe Gin Val Leu Ser Gin Lys Lys Val Gin He Asp 

50 55 60 

act ttt gaa gaa ggt cag ccc gaa teg gtc agt cac att gat ttg acg 
Thr Phe Glu Glu Gly Gin Pro Glu Ser Val Ser His He Asp Leu Thr 
65 70 75 

gat tgg gee gac tac tec ate gtg get ccg gca act gee aat ate ate 
Asp Trp Ala Asp Tyr Ser He Val Ala Pro Ala Thr Ala Asn He He 
80 85 90 



96 



ttg gca acg gac cac ccc att ttt tta gtc cca gee atg aac acc aag 
Leu Ala Thr Asp His Pro He Phe Leu Val Pro Ala Met Asn Thr Lys 

115 120 125 

atg tat gaa aat ccc get ctt aag aaa aac aag gee ttc ctt att gaa 
Met Tyr Glu Asn Pro Ala Leu Lys Lys Asn Lys Ala Phe Leu He Glu 

130 135 140 

cag ggc cat tac tgg atg gag ccg gat att gga ttt tta gca gag ggc 
Gin Gly His Tyr Trp Met Glu Pro Asp He Gly Phe Leu Ala Glu Gly 
145 150 155 

tac gaa ggc ttg ggt cgt ttt cca gac eta gac egg att atg gcg gaa 
Tyr Glu Gly Leu Gly Arg Phe Pro Asp Leu Asp Arg He Met Ala Glu 
160 " 165 170 



aaa gtc etc gtc aca gca ggt ggg acg gtg gag egg att gat ccc gtc 
Lys Val Leu Val Thr Ala Gly Gly Thr Val Glu Arg He Asp Pro Val 

195 200 205 



caa gcg gec tat gaa get ggg gee cag gtt age ttg gta aca gee. agt 
Gin Ala Ala Tyr Glu Ala Gly Ala Gin Val Ser Leu Val Thr Ala Ser 
225 "* 230 235 



144 



192 



240 



288 



ggc aag ctg gee aat ggg att ggg gac gat ttt gtt tea aca gee ttg 336 
Gly Lys Leu Ala Asn Gly He Gly Asp Asp Phe Val Ser Thr Ala Leu 
95 " 100 105 110 



384 



432 



480 



528 



ttt aac cat ttt att att get agg aat cca ggt ate eta tea gga aaa 57 6 

Phe Asn His Phe He He Ala Arg Asn Pro Gly He Leu Ser Gly Lys 
175 180 185 190 



624 



egg tat att tec aat gat tct tct ggt aag atg ggc cac caa ctt get 672 
Arg Tyr He Ser Asn Asp Ser Ser Gly Lys Met Gly His Gin Leu Ala 

210 215 220 



720 



gac ttg ccg acc agt ccc ttt att gac cgc ttt cag gtg gag tec acc 



768 
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Asp Leu Pro Thr Ser Pro Phe lie Asp Arg Phe Gin Val Glu Ser Thr 
240 245 250 

tta gac ttg tac caa aca gtt agt gac etc tat gac cac cat gac att 
Leu Asp Leu Tyr Gin Thr Val Ser Asp Leu Tyr Asp His His Asp He 
255 260 265 270 

etc atg atg gec gca gcg gtg tct gac tac egg cca gtc aac egg tea 
Leu Met Met Ala Ala Ala Val Ser Asp Tyr Arg Pro Val Asn Arg Ser 

275 280 285 

gac aaa aag atg aaa aag caa gat aat tta ace att gaa ctg gaa aaa 
Asp Lys Lys Met Lys Lys Gin Asp Asn Leu Thr He Glu Leu Glu Lys 

290 295 300 

aat cct gat att ttg gee gaa atg ggc egg egg aaa gac caa caa ate 
Asn Pro Asp He Leu Ala Glu Met Gly Arg Arg Lys Asp Gin Gin He 
305 310 315 

aat gtc ggc ttt gca gca gaa ace cat aac ctt gaa gaa tat gee caa 
Asn Val Gly Phe Ala Ala Glu Thr His Asn Leu Glu Glu Tyr Ala Gin 
320 325 330 

aaa aaa tta gee tec aaa caa get gac ttg ate gta gee aat gaa gtg 
Lys Lys Leu Ala Ser Lys Gin Ala Asp Leu He Val Ala Asn Glu Val 
335 " 340 345 350 

ggc egg gga gac egg ggc ttt aat gcg gat gaa aat gcg gee ctt gtt 
Gly Arg Gly Asp Arg Gly Phe Asn Ala Asp Glu Asn Ala Ala Leu Val 

355 360 365 



gat atg gca aaa aag att att gaa gtg gtg gec agt aaa ttg cct get 
Asp Met Ala Lys Lys He He Glu Val Val Ala Ser Lys Leu Pro Ala 
385 ~ 390 395 



tct ccc aaa taa 
Ser Pro Lys 
400 



816 



864 



912 



960 



1008 



1056 



1104 



ttt tec agt gac caa gat ccg ctt gag ctt ccc ctt cag tct aaa aaa 1152 
Phe Ser Ser Asp Gin Asp Pro Leu Glu Leu Pro Leu Gin Ser Lys Lys 

370 375 380 



1200 



1212 



<210> 78 
<211> 401 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 78 

Met Leu Lys Asn Lys Lys He Ala Leu Tyr Val Thr Gly Gly He Ala 
1 ~ 5 10 15 



Val Tyr Lys Ser Leu Tyr Leu Leu Arg Glu He He Lys Gin Gly Gly 

20 25 30 
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Glu Val Arg Val Ala Met Thr Gin Ala Ala Cys Gin Phe Val Asn Pro 
35 40 45 

Leu Ser Phe Gin Val Leu Ser Gin Lys Lys Val Gin He Asp Thr Phe 
50 55 60 

Glu Glu Gly Gin Pro Glu Ser Val Ser His He Asp Leu Thr Asp Trp 
65 70 75 80 

Ala Asp Tyr Ser He Val Ala Pro Ala Thr Ala Asn He He Gly Lys 
" ^ 85 90 95 

Leu Ala Asn Gly He Gly Asp Asp Phe Val Ser Thr Ala Leu Leu Ala 

100 105 HO 

Thr Asp His Pro He Phe Leu Val Pro Ala Met Asn Thr Lys Met Tyr 
115 120 125 

Glu Asn Pro Ala Leu Lys Lys Asn Lys Ala Phe Leu He Glu Gin Gly 
130 135 140 

His Tyr Trp Met Glu Pro Asp He Gly Phe Leu Ala Glu Gly Tyr Glu 
145 150 155 160 

Gly Leu Gly Arg Phe Pro Asp Leu Asp Arg He Met Ala Glu Phe Asn 

165 170 175 

His Phe He He Ala Arg Asn Pro Gly He Leu Ser Gly Lys Lys Val 

180 185 190 

Leu Val Thr Ala Gly Gly Thr Val Glu Arg He Asp Pro Val Arg Tyr 
195 200 205 

He Ser Asn Asp Ser Ser Gly Lys Met Gly His Gin Leu Ala Gin Ala 
210 215 220 

Ala Tyr Glu Ala Gly Ala Gin Val Ser Leu Val Thr Ala Ser Asp Leu 
225 230 235 240 

Pro Thr Ser Pro Phe He Asp Arg Phe Gin Val Glu Ser Thr Leu Asp 

245 250 255 
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Leu Tyr Gin Thr Val Ser Asp Leu Tyr Asp His His Asp lie Leu Met 

260 265 270 

Met Ala Ala Ala Val Ser Asp Tyr Arg Pro Val Asn Arg Ser Asp Lys 
275 280 285 



Lys Met Lys Lys Gin Asp Asn Leu Thr He Glu Leu Glu Lys Asn Pro 
290 295 300 

Asp He Leu Ala Glu Met Gly Arg Arg Lys Asp Gin Gin He Asn Val 
305 310 315 



Glv Phe Ala Ala Glu Thr His Asn Leu Glu Glu Tyr Ala Gin Lys Lys 

325 330 335 

Leu Ala Ser Lys Gin Ala Asp Leu He Val Ala Asn Glu Val Gly Arg 

340 345 350 

Glv Asp Arg Gly Phe Asn Ala Asp Glu Asn Ala Ala Leu Val Phe Ser 
355 360 365 



Ser Asp Gin Asp Pro Leu Glu Leu .Pro Leu Gin Ser Lys Lys Asp Met 
370 375 380 

Ala Lys Lys He He Glu Val Val Ala Ser Lys Leu Pro Ala Ser Pro 
385 390 395 400 



Lys 



<210> 79 
<211> 1053 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (22) . . (1053) 

<223> 



<400> 79 n 
aagaagaagg gaggaagact g atg aaa att gaa gac caa etc aaa aaa att 

Met Lys lie Glu Asp Gin Leu Lys Lys lie 
1 5 1° 

aaa gac caa gac ttg tct ccc etc tac ctg gtc cag gga gat gac cag 99 
Lys Asp Gin Asp Leu Ser Pro Leu Tyr Leu Val Gin Gly Asp Asp Gin 

15 20 25 
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etc caa aac cca gec gac ttt act gtt etc gtc ttc ttt gec ccc tat 
Leu Gin Asn Pro Ala Asp Phe Thr Val Leu Val Phe Phe Ala Pro Tyr 

HO 115 120 

gag aaa ctg gac aag egg aag aag gtc acc aaa gec eta ttg cag gaa 
Glu Lys Leu Asp Lys Arg Lys Lys Val Thr Lys Ala Leu Leu Gin Glu 
125 130 135 

get gag att ata gat gec agt tec cca gac caa aga gat eta aaa gat 
Ala Glu lie He Asp Ala Ser Ser Pro Asp Gin Arg Asp Leu Lys Asp 
140 145 150 



get tta aag gec ctg gtt gaa aaa acc aat gee aac tta agt egg gtc 
Ala Leu Lys Ala Leu Val Glu Lys Thr Asn Ala Asn Leu Ser Arg Val 

175 180 185 

atg caa gag ttg gac aag tta ttc ttg tac cat tta gat gac aaa ate 
Met Gin Glu Leu Asp Lys Leu Phe Leu Tyr His Leu Asp Asp Lys He 

190 195 200 

ate acc gtc cag tea gtt gac cag gtc gta tea cca age ctg gaa agt 
He Thr Val Gin Ser Val Asp Gin Val Val Ser Pro Ser Leu Glu Ser 
205 210 215 

aat gtc ttt agt att aac gac tat att tta age ggg caa age cag get 
Asn Val Phe Ser He Asn Asp Tyr He Leu Ser Gly Gin Ser Gin Ala 
220 225 230 

get ata egg gec ttt aat gac tta att caa caa aag gaa gag cca att 
Ala He Arg Ala Phe Asn Asp Leu He Gin Gin Lys Glu Glu Pro He 
235 240 245 250 



195 



243 



tac ttg tta gac cag gtt aaa aaa agt ttg age cag gee ctt ttg gac 147 
Tyr Leu Leu Asp Gin Val Lys Lys Ser Leu Ser Gin Ala Leu Leu Asp 

30 35 40 

cag gat gaa get tct atg aat ttt ggt caa ttt aat atg atg get gat 
Gin Asp Glu Ala Ser Met Asn Phe Gly Gin Phe Asn Met Met Ala Asp 
45 50 55 

age eta gac atg gee ttg tct gat gcg gaa tec tat ccc ttt ttt ggg 
Ser Leu Asp Met Ala Leu Ser Asp Ala Glu Ser Tyr Pro Phe Phe Gly 
60 65 70 

gac aag cgc ctg gtt tac ate caa gac ccc ttt ttc eta aca ggg gag 
Asp Lys Arg Leu Val Tyr He Gin Asp Pro Phe Phe Leu Thr Gly Glu 
75 80 85 90 

aag egg aaa aca gat ctg gac cat gac ttg gat cgc ttg ctg get tac 339 
Lys Arg Lys Thr Asp Leu Asp His Asp Leu Asp Arg Leu Leu Ala Tyr 

95 100 105 



291 



387 



435 



483 



atg gtc cag aaa aaa gta aag get cga ggc tac cag ttt gac aaa gga 531 
Met Val Gin Lys Lys Val Lys Ala Arg Gly Tyr Gin Phe Asp Lys Gly 
155 160 165 170 



579 



627 



675 



723 



771 
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aaa ate ate gee att atg atg aac caa ttc cgt tta tta ttg cag gtt 
Lys He He Ala He Met Met Asn Gin Phe Arg Leu Leu Leu Gin Val 

255 260 265 

aaa ata ttg egg act aag ggc tac caa caa gga gag ate get aaa ate 
Lys He Leu Arg Thr Lys Gly Tyr Gin Gin Gly Glu He Ala Lys He 

270 275 280 

tta aaa gtt cac ccc tac egg gtt aag eta gee ata gag aaa cag gag 
Leu Lys Val His Pro Tyr Arg Val Lys Leu Ala He Glu Lys Gin Glu 
285 290 295 

att ttt tec aag caa agt eta teg acc gee tac cgc tac tta att gag 
lie Phe Ser Lys Gin Ser Leu Ser Thr Ala Tyr Arg Tyr Leu He Glu 
300 305 310 

tea gat cat ttg att aaa acg ggc aag gtg acc teg caa ttg caa ttt 
Ser Asp His Leu He Lys Thr Gly Lys Val Thr Ser Gin Leu Gin Phe 
315 320 325 330 

gaa ctt ttt gee eta caa ttt aaa gat tct gtc atg aat taa 
Glu Leu Phe Ala Leu Gin Phe Lys Asp Ser Val Met Asn 

335 340 



<210> 80 
<211> 343 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 80 

Met Lys He Glu Asp Gin Leu Lys Lys He Lys Asp Gin Asp Leu Ser 

^ 10 15 



Pro Leu Tyr Leu Val Gin Gly Asp Asp Gin Tyr Leu Leu Asp Gin Val 

20 25 30 

Lys Lys Ser Leu Ser Gin Ala Leu Leu Asp Gin Asp Glu Ala Ser Met 
35 40 45 

Asn Phe Gly Gin Phe Asn Met Met Ala Asp Ser Leu Asp Met Ala Leu 
50 55 60 

Ser Asp Ala Glu Ser Tyr Pro Phe Phe Gly Asp Lys Arg Leu Val Tyr 
65 70 75 80 

He Gin Asp Pro Phe Phe Leu Thr Gly Glu Lys Arg Lys Thr Asp Leu 

85 90 95 

Asp His Asp Leu Asp Arg Leu Leu Ala Tyr Leu Gin Asn Pro Ala Asp 

100 1°5 HO 



819 



867 



915 



963 



1011 



1053 
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Phe Thr Val Leu Val Phe Phe Ala Pro Tyr Glu Lys Leu Asp Lys Arg 
115 120 125 

Lvs Lys Val Thr Lys Ala Leu Leu Gin Glu Ala Glu He He Asp Ala 
130 135 140 

Ser Ser Pro Asp Gin Arg Asp Leu Lys Asp Met Val Gin Lys Lys Val 
145 150 155 160 

Lys Ala Arg Gly Tyr Gin Phe Asp Lys Gly Ala Leu Lys Ala Leu Val 
" 165 170 175 

Glu Lys Thr Asn Ala Asn Leu Ser Arg Val Met Gin Glu Leu Asp Lys 

180 185 1^0 

Leu Phe Leu Tyr His Leu Asp Asp Lys He He Thr Val Gin Ser Val 
195 200 205 

Asp Gin Val Val Ser Pro Ser Leu Glu Ser Asn Val Phe Ser He Asn 
210 215 220 

Asp Tyr He Leu Ser Gly Gin Ser Gin Ala Ala He Arg Ala Phe Asn 
225 230 235 240 

Asp Leu He Gin Gin Lys Glu Glu Pro He Lys He He Ala lie Met 

245 250 255 

Met Asn Gin Phe Arg Leu Leu Leu Gin Val Lys He Leu Arg Thr Lys 

260 265 270 

Gly Tyr Gin Gin Gly Glu He Ala Lys He Leu Lys Val His Pro Tyr 
275 280 285 

Arg Val Lys Leu Ala He Glu Lys Gin Glu He Phe Ser Lys Gin Ser 
290 295 300 

Leu Ser Thr Ala Tyr Arg Tyr Leu He Glu Ser Asp His Leu He Lys 
305 310 315 320 

Thr Gly Lys Val Thr Ser Gin Leu Gin Phe Glu Leu Phe Ala Leu Gin 

325 330 335 
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Phe Lys Asp Ser Val Met Asn 

340 



<210> 81 
<211> 477 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (1) . . (477) 

<223> 



^""cgc gca ate tat gca ggc agt ttt gat ccg att acc ctg ggc 48 
Met Asn Arg La He Tyr Ala Gly Ser Phe Asp Pro He Thr Leu Gly 



1 5 



cac ctg gat ate att aaa agg gec age cac tta ttc gat gaa gtc ate 
His Leu Asp lie He Lys Arg Ala Ser His Leu Phe Asp Glu Val He 

20 25 30 

att gca gtt get aat aat aca teg aaa aat agt atg ttg aac ttt gac 
Val Ala Val Ala Asn Asn Thr Ser Lys Asn Ser Met Leu Asn Phe Asp 
35 40 45 

caa aaa ttg aac ctg gtt gaa caa tea att get age cag ggt eta get 
Gin Lys Leu Asn Leu Val Glu Gin Ser He Ala Ser Gin Gly Leu Ala 
50 55 60 



aat gtt caa gee aag aca tta gag tea ggc ttg att gtt gac ttt get 
Asn Val Gin Ala Lys Thr Leu Glu Ser Gly Leu He Val Asp Phe Ala 
65 70 75 80 

aag gac caa gga get agt agt ctg gtt agg ggg ttg egg teg gtt aaa 
Lys Asp Gin Gly Ala Ser Ser Leu Val Arg Gly Leu Arg Ser Val Lys 

85 90 95 

gac ttt gaa tat gag att gee att gag gac tta aat aag gtc caa gac 
Asp Phe Glu lyr Glu He Ala He Glu Asp Leu Asn Lys Val Gin Asp 

100 105 HO 

cca get att gaa aca gtt tac eta gtc teg tct tee aaa tac egg tec 
Pro Ala He Glu Thr Val Tyr Leu Val Ser Ser Ser Lys Tyr Arg Ser 
115 120 125 

att tct tec tct att gtt egg gaa att att aag ttt aat ggc egg ett 
He Ser Ser Ser He Val Arg Glu He He Lys Phe Asn Gly Arg Leu 
130 135 140 

gat gac eta gta cct gac ccc gtc gtc gaa tat ttt aaa aaa taa 
Asp Asp Leu Val Pro Asp Pro Val Val Glu Tyr Phe Lys Lys 
145 ~ 150 155 



96 



144 



192 



240 



288 



336 



384 



432 



477 
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<210> 82 
<211> 158 
<212> PRT 

<213> Alloiococcus otitidis 

Me?°Ln 2 Arg Ala He Tyr Ala Gly Ser Phe Asp Pro He Thr Leu Gly 

C 10 



His Leu Asp He He Lys Arg Ala Ser His Leu Phe Asp Glu Val He 

2 0 

Ala Val Ala Asn Asn Thr Ser Lys Asn Ser Met Leu Asn Phe Asp 



Val 

35 



40 45 



Gin Lys Leu Asn Leu Val Glu Gin Ser He Ala Ser Gin Gly Leu Ala 
50 



55 6° 



Asn Val Gin Ala Lys Thr Leu Glu Ser Gly Leu lie Val Asp Phe Ala 
65 70 75 



L ys Asp Gin Gly Ala Ser Ser Leu Val Arg Gly Leu Arg Ser Val Lys 

85 ~ 



Asp Phe Glu Tyr Glu He Ala He Glu Asp Leu Asn Lys Val Gin Asp 



100 



105 



Pro Ala He Glu Thx Val Tyr Leu Val Ser Ser Ser Lys Tyr Arg Ser 
115 I 20 125 

He Ser Ser. Ser He Val Arg Glu He He Lys Phe Asn Gly Arg Leu 
130 135 140 

Asp Asp Leu Val Pro Asp Pro Val Val Glu Tyr Phe Lys Lys 
14 5 150 155 



<210> 83 
<211> 1260 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (28) . . (1260) 

<223> 



<400> 83 
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ataaggattt caggaggaat actcata atg gat ttc aac tta gat aat aca gtt 

Met Asp Phe Asn Leu Asp Asn Thx Val 
1 5 

tea ggt ggc gca aag att aag gtt att ggt gtt ggc ggt get ggt ggc 
Ser Gly Gly Ala Lys He Lys Val He Gly Val Gly Gly Ala Gly Gly 
10 15 20 25 

aat gec gtt aac egg atg att gaa gat gga gtc gaa ggc gtt gaa ttt 
Asn Ala Val Asn Arg Met He Glu Asp Gly Val Glu Gly Val Glu Phe 

30 35 40 

att gta gec aat aca gat gtc caa gec ctt gat gec aac cga get gag 
He Val Ala Asn Thr Asp Val Gin Ala Leu Asp Ala Asn Arg Ala Glu 

45 50 55 

act aaa att caa etc gga gag aag tta acc agg gga etc ggt gec gga 
Thr Lys He Gin Leu Gly Glu Lys Leu Thr Arg Gly Leu Gly Ala Gly 
60 65 70 

get aat cca gaa gtt ggc cgt aag teg get gaa gag agt gaa gaa acc 
Ala Asn Pro Glu Val Gly Arg Lys Ser Ala Glu Glu Ser Glu Glu Thr 
75 80 85 

att gee gaa get ctt gaa gga get gac atg gtc ttc gtt act get ggt 
He Ala Glu Ala Leu Glu Gly Ala Asp Met Val Phe Val Thr Ala Gly 
90 95 100 105 

atg ggt ggc ggt act ggt act ggc ggg gcg ggc att att gee cgc att 
Met Gly Gly Gly Thr Gly Thr Gly Gly Ala Gly He He Ala Arg He 

110 115 120 

gec aaa gaa caa ggg get ttg act gta ggg gtt att acc egg ccg ttc 
Ala Lys Glu Gin Gly Ala Leu Thr Val Gly Val He Thr Arg Pro Phe 

125 130 135 

act ttt gaa gga cca aaa cgt ggg cgc ttt gca gee gaa ggg att gee 
Thr Phe Glu Gly Pro Lys Arg Gly Arg Phe Ala Ala Glu Gly He Ala 
140 145 150 

caa atg egg gaa cat gtt gac acc ctt gtc acc ate tec aac aac cgc 
Gin Met Arg Glu His Val Asp Thr Leu Val Thr He Ser Asn Asn Arg 
155 160 165 

ttg eta gaa att gtg gac aag aaa aca ccg atg atg gaa gee ttc aga 
Leu Leu Glu He Val Asp Lys Lys Thr Pro Met Met Glu Ala Phe Arg 
170 175 180 185 

gaa gca gat aat gtc etc cgc caa ggg gtt caa ggt ata tct gac ttg 
Glu Ala Asp Asn Val Leu Arg Gin Gly Val Gin Gly He Ser Asp Leu 

190 195 200 

att acc aat cca ggc tac gtc aac tta gac ttt gee gat gtc aaa acg 
He Thr Asn Pro Gly Tyr Val Asn Leu Asp Phe Ala Asp Val Lys Thr 

205 210 215 



54 



102 



150 



198 



246 



294 



342 



390 



438 



486 



534 



582 



630 



678 



gtg atg gec aac caa ggt tct gec ttg atg ggg att ggg tct get tea 726 
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val Met Ala Asn Gin Gly Ser Ala Leu Met Gly He Gly Ser Ala Ser 



220 225 

ggt gag aat aga acg get gaa get act aag aaa get att tea tet cea 
Sly Glu Asn Arg Thr Ala Glu Ala Thr Lys Lys Ala He Ser Ser Pro 
" 235 240 245 



ctt ttg gaa gtc tec etc aat ggg get gaa aat gtc eta tta aac ata 
Leu Leu Glu Val Ser Leu Asn Gly Ala Glu Asn Val Leu Leu Asn lie 
250 ^ 

arc aaa aac caa gac tta acc etc ttt gaa get caa gat get tet gat 

ace gga aac y« _ ^ m 3 r^i -n as™ Ma Ser Asp 

Asp 

270 

rrrrrr rrr^f act oct tet agt gat gtt aat att ate ttc ggt act 
ate gtc ggg get get get tec ggt g ^ ^ T i Q Tlea T hr 

Ala 

285 



aor- naa aac caa gac tta acc cue ctt- ycaa y^«- ^ — - 

£ G?y Asn Gin Lp Leu Thr Leu Phe Glu Ala Gin Asp Ala Ser Asp 

ate gtc ggg get get get tet ggt gac gtt aat att ate ttc ggt 
Se Val G?y Ala Ala Ala Ser Gly Asp Val Asn He He Phe Gly Thr 
"~ *" 290 295 ' 



tec ate aat gaa gac ctg gaa gat gag gtc ate gtt ace gtt att gca 
Ser He Asn Glu Asp Leu Glu Asp Glu Val He Val Thr Val He Ala 



300 



305 310 



act ggt ate act ggt aaa gac atg ggc gag aaa tet tet aaa tec tea 
Thr Gly He Thr Gly Lys Asp Met Gly Glu Lys Ser Ser Lys Ser Ser 
315 320 325 

aac cgt age caa ggt eet agt caa aaa agt caa get ega tea get agt 
Asn Arg Ser Gin Gly Pro Ser Gin Lys Ser Gin Ala Arg Ser Ala Ser 
330 335 340 

gag tet age ttc tet age tgg caa aac caa tee aat gaa aga cea ggg 
Glu Ser Ser Phe Ser Ser Trp Gin Asn Gin Ser Asn Glu Arg Pro Gly 

350 355 360 

gaa gac caa gac ega cea age tet caa aga egg gaa gtc gat egg tec 
Glu Lp Gin Asp Arg Pro Ser Ser Gin Arg Arg Glu Val Asp Arg Ser 

365 370 375 

gaa aac ctg ttc aat gac gat agt aag gac cag cea gca gac tet ggt 
Glu Asn Leu Phe Asn Asp Asp Ser Lys Asp Gin Pro Ala Asp Ser Gly 
380 385 390 

gat gat gac gaa ttg gat acc cet eet ttc ttt aga cgt cgc cgc aag 
Asp Asp Asp Glu Leu Asp Thr Pro Pro Phe Phe Arg Arg Arg Arg Lys 
395 400 405 



aat tag 

Asn 

410 



774 



822 



870 



918 



966 



1014 



1062 



1110 



1158 



1206 



1254 



1260 



<210> 84 
<211> 410 
<212> PRT 

<213> Alloiococcus otitidis 
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<400> 84 _ t 

Met Asp Phe Asn Leu Asp Asn Thr Val Ser Gly Gly Ala Lys lie Lys 
15 10 15 

Val He Gly Val Gly Gly Ala Gly Gly Asn Ala Val Asn Arg Met He 

20 25 30 

Glu Asp Gly Val Glu Gly Val Glu Phe He Val Ala Asn Thr Asp Val 
35 40 45 

Gin Ala Leu Asp Ala Asn Arg Ala Glu Thr Lys He Gin Leu Gly Glu 
50 55 60 

Lys Leu Thr Arg Gly Leu Gly Ala Gly Ala Asn Pro Glu Val Gly Arg 
65 70 75 80 

Lys Ser Ala Glu Glu Ser Glu Glu Thr He Ala Glu Ala Leu Glu Gly 

85 90 95 

Ala Asp Met Val Phe Val Thr Ala Gltf Met Gly Gly Gly Thr Gly Thr 

100 105 HO 

Gly Gly Ala Gly He He Ala Arg He Ala Lys Glu Gin Gly Ala Leu 
115 120 125 

Thr Val Gly Val He Thr Arg Pro Phe Thr Phe Glu Gly Pro Lys Arg 
130 135 140 

Gly Arg Phe Ala Ala Glu Gly He Ala Gin Met Arg Glu His Val Asp 
145 150 155 160 

Thr Leu Val Thr He Ser Asn Asn Arg Leu Leu Glu He Val Asp Lys 

165 170 175 

Lys Thr Pro Met Met Glu Ala Phe Arg Glu Ala Asp Asn Val Leu Arg 

180 185 190 

« 

Gin Gly Val Gin Gly He Ser Asp Leu He Thr Asn Pro Gly Tyr Val 
195 200 205 

Asn Leu Asp Phe Ala Asp Val Lys Thr Val Met Ala Asn Gin Gly Ser 
210 215 220 
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Ala Leu Met Gly He Gly Ser Ala Ser Gly Glu Asn Arg Thr Ala Glu 



225 



230 



235 



Ala Thr Lys Lys Ala He Ser Ser Pro Leu Leu Glu Val Ser Leu Asn 

250 



245 



Gly Ala Glu Asn Val Leu Leu Asn He Thr Gly Asn Gin Asp Leu Thr 

260 265 270 



Leu Phe Glu Ala Gin Asp Ala Ser Asp He Val Gly Ala Ala Ala Ser 
275 



280 285 



Gly Asp Val Asn He He Phe Gly Thr Ser He Asn Glu Asp Leu Glu 



290 



295 



Asp Glu Val He Val Thr Val He Ala Thr Gly He Thr Gly Lys Asp 



305 



310 



Met Gly Glu Lys Ser Ser Lys Ser Ser Asn Arg Ser Gin Gly Pro Ser 

325 330 



Gin Lys Ser Gin Ala Arg Ser Ala Ser Glu Ser Ser Phe Ser Ser Trp 

340 345 

Gin Ser Asn Glu Arg Pro Gly Glu Asp Gin Asp Arg Pro Ser 



Gin Asn 

355 



360 365 



Ser 

370 



Gin Arg Arg Glu Val Asp Arg Ser Glu Asn Leu Phe Asn Asp Asp 



375 380 



Ser Lys Asp Gin Pro Ala Asp Ser Gly Asp Asp Asp Glu Leu Asp Thr 
385 390 395 

Pro Pro Phe Phe Arg Arg Arg Arg Lys Asn 

405 410 



<210> 85 
<211> 1377 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (13) . . (1377) 

<223> 
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acaaataata ga atg ttt tta gat atg gag gtt tea atg aat atg aaa aat 51 
acagataata ga ajr ^ ^ ^ ^ ^ ^ ^ ^ ^ Mefc Lyg Asn 



1 5 



ggg gtt tat aca age ctt gat att gga aee act tea ata aaa gta gtt 
Glv Val Tyr Thr Ser Leu Asp He Gly Thr Thr Ser He Lys Val Val 
15 20 25 

ni . r aot aaa gtt gat aat aat cag etc aaa gtt att gga gta gga aaa 
Val Ser Glu VaJ Lp Asn Asn Gin Leu Lys Val He Gly Val Gly Lys 
30 35 40 



get eaa tea aaa ggt tta aaa agg ggc atg gtt gtc .gat ata gat get 
La Gin Ser Lys Gly Leu Lys Arg Gly Met Val Val Asp He Asp Ala 

50 55 

acc ate cag gee att cat act gca gtg aag cag get get gat aag act 
Thr Val Gin A^a He His Thr Ala Val Lys Gin Ala Ala Asp Lys Thr 

"65 70 75 

ggt gtt atg ate aac cag etc att gtt gga gtt cct get aat ggt gtt 
Sy Val Me? He Asn Gin Leu He Val Gly Val Pro Ala Asn Gly Val 
80 85 

agt att gaa ccc tgt cac ggg gtc att act gta gat gac egg tec aag 
Ser He Glu Pro Cys His Gly Val He Thr Val Asp Asp Arg Ser Lys 
95 100 1° 5 



gaa ata gac age cag gaa gtg aac egg gta gtc aac. cag tec att get 
Glu He Asp Ser Gin Glu Val Asn Arg Val Val Asn Gin Ser He Ala 
110 H5 120 125 



aat ate gtt ccg cca gat aga gac tta tta tec gtc agt tta gaa gaa 
Asn 111 Val Pro Pro Asp Arg Asp Leu Leu Ser Val Ser Leu Glu Glu 

130 135 140 

ttt att gta gat ggt ttt gat gaa att cat gat ccg aga ggc atg gtg 
Phe He Val Asp Gly Phe Asp Glu He His Asp Pro Arg Gly Met Val 

145 15° 155 

ggc cag egg tta gaa ctt tac ggg aca gca att tea gtg cct aaa aca 
G?y Gin Arg Leu Glu Leu Tyr Gly Thr Ala lie Ser Val Pro Lys Thr 
160 165 I 70 

att tta cat aac att aga cgt tgt gtt gaa aaa gcg ggc tat caa att 
He Leu His Asn He Arg Arg Cys Val Glu Lys Ala Gly Tyr Gin He 
175 180 185 

get gee tta att etc cag ccc caa gee atg gee aag gta gee ttg tct 
Ala Ala Leu He Leu Gin Pro Gin Ala Met Ala Lys Val Ala Leu Ser 
190 195 200 205 

gag gat gag egg aat ttt ggt aca gtt atg gtg gat ata ggc gga ggt 
Glu Lp Glu Arg Asn Phe Gly Thr Val Met Val Asp He Gly Gly Gly 

210 215 220 



99 



147 



195 



243 . 



291 



339 



387 



435 



483 



531 



579 



627 



675 
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caa acg acc eta tea gec att cac gat gag caa gtg aag tat gec aat 
Gin Thr Thr Leu Ser Ala He His Asp Glu Gin Val Lys Tyr Ala Asn 

225 230 235 

gtg gtc caa gaa gec gga gaa tat att acc aaa gac att tec att gtc 
Va? Val Gin Glu Ala Gly Glu Tyr He Thr Lys Asp lie Ser He Val 
240 245 250 

ate aac acc tea cag caa aat gca gaa aag etc aaa aga gaa gtt ggg 
lie Asn Thr Ser Gin Gin Asn Ala Glu Lys Leu Lys Arg Glu Val Gly 

260 265 



723 



255 



gec att aaa agt cag tct gat tea act gtt caa gta gat gtt gta ggt 
Ala He Lys Ser Gin Ser Asp Ser Thr Val Gin Val Asp Val Val Gly 



caa aat gaa cct gtg aag att aaa gaa tec tat gtc ggt gaa att att 
G^n Asn Glu Pro Val Lys He Lys Glu Ser Tyr Val Gly Glu lie He 

290 295 300 



aaa gec egg gtt age caa ate ttt gaa aaa gtg aag get gac ctt gac 
llu l^a Arg Val Ser Gin He Phe Glu Lys Val Lys Ala Asp Leu Asp 

305 31° 315 

cca att aac gee ttc caa ttg cca ggt ggt gee gtt att tec ggc ggt 
Pro lie Asn Ala Phe Gin Leu Pro Gly Gly Ala Val lie Ser Gly Gly 
320 325 330 

tea get gec ata cca ggt att gac age ttg get gaa gac ate ttc aag 
Ser Ala La He Pro Gly He Asp Ser Leu Ala Glu Asp He Phe Lys 
335 340 345 

gtt egg tea gag etc tac att ccc gac tac atg ggt ate cga act ccc 
Val Arg Ser Glu Leu Tyr He Pro Asp Tyr Met Gly He Arg Thr Pro 
350 355 360 

gec ttc act gtg gca gtc ggc ttg acc etc tac caa gec cag act tct 
Ala Phe Thr Val Ala Val Gly Leu Thr Leu Tyr Gin Ala Gin Thr Ser 

370 375 380 

gat att gag egg gee ate aac cag tec ate ttg caa aat ate ggt att 
Asp He Glu Arg Ala He Asn Gin Ser He Leu Gin Asn lie Gly He 

385 390 395 

aat cca gat age cag cct get aac egg ata gtt gac cag gat gat tea 
Asn Pro Asp Ser Gin Pro Ala Asn Arg He Val Asp Gin Asp Asp Ser 
400 405 410 

gtc caa agt cag gac caa aag acg caa gat gag cca gca gga gac caa 
Val Gin Ser Gin Asp Gin Lys Thr Gin Asp Glu Pro Ala Gly Asp Gin 
415 420 425 

get agt cag teg gat agt cca gaa gaa ggc aat ttt aca gac aga ate 
Ala Ser Gin Ser Asp Ser Pro Glu Glu Gly Asn Phe Thr Asp Arg 
430 435 440 



771 



819 



867 



915 



963 



1011 



1059 



1107 



1155 



1203 



1251 



1299 



1347 
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1377 

aag cat ttc ttt act aca ttt ttc gat taa 
Lys His Phe Phe Thr Thr Phe Phe Asp 

450 



<210> 86 
<211> 454 
<212> PRT 

<213> Alio io coccus otitidis 



Met°Phe 6 Leu Asp Met Glu Val Ser Met Asn Met Lys Asn Gly Val Tyr 
1 



5 10 15 



Thr Ser Leu Asp He Gly Thr Thr Ser He Lys Val Val Val Ser Glu 

20 25 30 

Val Asp Asn Asn Gin Leu Lys Val He Gly Val Gly Lys Ala Gin Ser 
35 40 45 

Lys Gly Leu Lys Arg Gly Met Val Val Asp He Asp Ala Thr Val Gin 
50 55 60 

Ala He His Thr Ala Val Lys Gin Ala Ala Asp Lys Thr Gly Val Met 
65 70 75 80 

He Asn Gin Leu He Val Gly Val Pro Ala Asn Gly Val Ser lie Glu 

85 90 95 

Pro Cys His Gly Val He Thr Val Asp Asp Arg Ser Lys Glu He Asp 

100 105 HO 

Ser Gin Glu Val Asn Arg Val Val Asn Gin Ser He Ala Asn He Val 
115 120 125 

Pro Pro Asp Arg Asp Leu Leu Ser Val Ser Leu Glu Glu Phe He Val 
130 135 140 

Asp Gly Phe Asp Glu He His Asp Pro Arg Gly Met Val Gly Gin Arg 
145 " 150 155 «0 

Leu Glu Leu Tyr Gly Thr Ala He Ser Val Pro Lys Thr He Leu His 

165 170 I 75 

Asn He Arg Arg Cys Val Glu Lys Ala Gly Tyr Gin He Ala Ala Leu 

180 185 190 
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He Leu Gin Pro Gin Ala Met Ala Lys Val Ala Leu Ser Glu Asp Glu 
195 200 205 

Arg Asn Phe Gly Thr Val Met Val Asp He Gly Gly Gly Gin Thr Thr 
210 215 220 

Leu Ser Ala He His Asp Glu Gin Val Lys Tyr Ala Asn Val Val Gin 
225 230 235 240 

Glu Ala Gly Glu Tyr He Thr Lys Asp He Ser He Val He Asn Thr 

245 250 255 

Ser Gin Gin Asn Ala Glu Lys Leu Lys Arg Glu Val Gly Ala He Lys 

260 265 270 

Ser Gin Ser Asp Ser Thr Val Gin Val Asp Val Val Gly Gin Asn Glu 
275 280 285 

Pro Val Lys He Lys Glu Ser Tyr Val Gly Glu He He Glu Ala Arg 
290 295 300 

Val Ser Gin He Phe Glu Lys Val Lys Ala Asp Leu Asp Pro He Asn 
305 310 315 320 

Ala Phe Gin Leu Pro Gly Gly Ala Val He Ser Gly Gly Ser Ala Ala 

325 330 335 

He Pro Gly He Asp Ser Leu Ala Glu Asp He Phe Lys Val Arg Ser 

340 345 350 

Glu Leu Tyr He Pro Asp Tyr Met Gly He Arg Thr Pro Ala Phe Thr 
355 360 365 

Val Ala Val Gly Leu Thr Leu Tyr Gin Ala Gin Thr Ser Asp He Glu 
370 375 380 

Arg Ala He Asn Gin Ser He Leu Gin Asn He Gly He Asn Pro Asp 
385 390 395 400 

Ser Gin Pro Ala Asn Arg He Val Asp Gin Asp Asp Ser Val Gin Ser 

405 410 415 
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Gin Asp Gin Lys Thr Gin Asp Glu Pro Ala Gly Asp Gin Ala Ser Gin 

420 425 430 

Ser Asp Ser Pro Glu Glu Gly Asn Phe Thr Asp Arg He Lys His Phe 
435 440 445 



Phe Thr Thr Phe Phe Asp 
450 



<210> 87 
<211> 1179 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (16) . . (1179) 

<223> 

<400> 87 

agcaaaggag caagt atg gaa act aaa aaa caa gca tta aaa gtt tta tta 

Met Glu Thr Lys Lys Gin Ala Leu Lys Val Leu Leu 
1 5 10 

tea ggc ggt gga aca ggt ggc cat ate tac cca gec ttg gec ctt get 
Ser Gly Gly Gly Thr Gly Gly His He Tyr Pro Ala Leu Ala Leu Ala 
15 " 20 25 

V 

aag cac eta get age tta cac tea gat gtc gag ttt ttg tat gtt ggc 
Lys His Leu Ala Ser Leu His Ser Asp Val Glu Phe Leu Tyr Val Gly 
30 35 40 

act caa agg gga ttg gaa aat aaa ttg gtc ccc caa gca gga ctt gac 
Thr Gin Arg Gly Leu Glu Asn Lys Leu Val Pro Gin Ala Gly Leu Asp 
45 50 55 60 

ttt ate ccg ate aaa gta gaa gga ttt age egg aag ttt aac ttc aaa 
Phe He Pro He Lys Val Glu Gly Phe Ser Arg Lys Phe Asn Phe Lys 

65 70 75 

age att aaa tat aat act aaa agt ctg att tat ttt eta aag gec ctg 
Ser He Lys Tyr Asn Thr Lys Ser Leu He Tyr Phe Leu Lys Ala Leu 

80 85 90 

agt aag tct aag caa ate ate aaa gac ttt cag cca gat gtg gta ata 
Ser Lys Ser Lys Gin He He Lys Asp Phe Gin Pro Asp Val Val He 
95 100 105 

ggg aca ggt ggt tat gtt tgt gee cct gtc ata tac cag gcg acc aag 
Gly Thr Gly Gly Tyr Val Cys Ala Pro Val He Tyr Gin Ala Thr Lys 
110 ** 115 120 



51 



99 



147 



195 



243 



291 



339 



387 



tta ggc att cca agt etc att cac gaa caa aat agt gtc gec ggg gtg 435 
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Leu Gly He Pro Ser Leu He His Glu Gin Asn Ser Val Ala Gly Val 
125 "0 135 140 

acc aat aag ttt ttg get egg tac gta gac aag att gec eta agt ttc 
Thr Asn Lys Phe Leu Ala Arg Tyr Val Asp Lys He Ala Leu Ser Phe 

145 150 155 

cag gaa get gaa aaa tec ttt gee aag tat aag gat aag ctg gtt ttg 
Gin Glu Ala Glu Lys Ser Phe Ala Lys Tyr Lys Asp Lys Leu Val Leu 

160 165 170 

act ggt aat cca aga gga cag gaa gtc age caa gtc aag ggt ggc ctt 
Thr Gly Asn Pro Arg Gly Gin Glu Val Ser Gin Val Lys Gly Gly Leu 

180 185 



17 5 



age etc eac aag tat ggc atg gac atg tec caa cct tea gta att att 
Ser Leu His Lys Tyr Gly Met Asp Met Ser Gin Pro Ser Val He lie 
190 195 200 

ttt ggt ggg tea agg ggg get tat get att aat aag gee ttt gtt gag 
Phe Gly Gly Ser Arg Gly Ala Tyr Ala He Asn Lys Ala Phe Val Glu 
205 ' 210 215 220 

gca tat agt caa ctg get gag agg gac tac cag gtc ttg ttt gtg ccg 
Ala Tyr Ser Gin Leu Ala Glu Arg Asp Tyr Gin Val Leu Phe Val Pro 

225 230 235 

gga tea get aat ttt age egg ata aaa cag gaa att gat aac cgc tat 
Gly Ser Ala Asn Phe Ser Arg He Lys Gin Glu He Asp Asn Arg Tyr 

240 245 250 

ggc cag cat aag ccg tea aac att ttt att gaa tec tat ate gat aac 
Gly Gin His Lys Pro Ser Asn He Phe He Glu Ser Tyr He Asp Asn 
255 260 265 

atg ccc caa gtt ttt aag get att gac ttg gtg gtt tgc cgt agt ggg 
Met Pro Gin Val Phe Lys Ala He Asp Leu Val Val Cys Arg Ser Gly 
270 275 280 

gee act acc eta gec gaa att atg tea tta ggc ttg gee age att tta 
Ala Thr Thr Leu Ala Glu He Met Ser Leu Gly Leu Ala Ser He Leu 
285 290 295 300 

att cca agt ccc aat gta acg get gac cac caa acc aaa aat get atg 
He Pro Ser Pro Asn Val Thr Ala Asp His Gin Thr Lys Asn Ala Met 

305 310 315 

agt ttg gtt aac caa caa get ggc tta atg att aag gaa aat gat eta 
Ser Leu Val Asn Gin Gin Ala Gly Leu Met He Lys Glu Asn Asp Leu 

320 325 330 

aat ggc caa age etc tta aac tgc tta gat gac ctg atg cat gat gac 
Asn Gly Gin Ser Leu Leu Asn Cys Leu Asp Asp Leu Met His Asp Asp 
335 340 345 

gca aaa aga aac aag atg gec caa caa gcg aaa gaa atg ggc caa ccc 
Ala Lys Arg Asn Lys Met Ala Gin Gin Ala Lys Glu Met Gly Gin Pro 



483 



531 



579 



627 



675 



723 



771 



819 



867 



915 



963 



1011 



1059 



1107 
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350 



355 360 



caa get tea gac aag ttg ate get etc ate ttg tec atg gtt aag gaa 1155 
Gin Ala Ser Asp Lys Leu lie Ala Leu lie Leu Ser Met Val Lys Glu 
365 " 370 375 380 



gat att aac tea gac ate gat taa 
Asp lie Asn Ser Asp lie Asp 

385 



<210> 88 
<211> 387 
<212> PRT 

<213> Alloiococcus otitidis 

<400> 88 _ _ 

Met Glu Thr Lys Lys Gin Ala Leu Lys Val Leu Leu Ser Gly Gly Gly 

15 10 15 

Thr Gly Gly His lie Tyr Pro Ala Leu Ala Leu Ala Lys His Leu Ala 

20 25 30 

Ser Leu His Ser Asp Val Glu Phe Leu Tyr Val Gly Thr. Gin Arg Gly 
35 40 45 

Leu Glu Asn Lys Leu Val Pro Gin Ala Gly Leu Asp Phe lie Pro lie 
50 55 60 

Lys Val Glu Gly Phe Ser Arg Lys Phe Asn Phe Lys Ser He Lys Tyr 
65 70 75 80 

Asn Thr Lys Ser Leu He Tyr Phe Leu Lys Ala Leu Ser Lys Ser Lys 

85 90 95 

Gin He He Lys Asp Phe Gin Pro Asp Val Val He Gly Thr Gly Gly 

100 105 HO 

Tyr Val Cys Ala Pro Val He Tyr Gin Ala Thr Lys Leu Gly He Pro 
115 120 125 

Ser Leu He His Glu Gin Asn Ser Val Ala Gly Val Thr Asn Lys Phe 
130 135 140 

Leu Ala Arg Tyr Val Asp Lys He Ala Leu Ser Phe Gin Glu Ala Glu 
145 ~ " 150 155 160 



1179 



WO 03/104391 PCT/US02/36122 

198/235 

Lys Ser Phe Ala Lys Tyr Lys Asp Lys Leu Val Leu Thr Gly Asn Pro 

165 170 

Arg Gly Gin Glu Val Ser Gin Val Lys Gly Gly Leu Ser Leu His Lys 

180 185 190 

Tyr Gly Met Asp Met Ser Gin Pro Ser Val He He Phe Gly Gly Ser 
195 200 205 

Arg Gly Ala Tyr Ala He Asn Lys Ala Phe Val Glu Ala Tyr Ser Gin 
210 215 220 

Leu Ala Glu Arg Asp Tyr Gin Val Leu Phe Val Pro Gly Ser Ala Asn 
225 230 235 240 

Phe Ser Arg He Lys Gin Glu He Asp Asn Arg Tyr Gly Gin His Lys 

245 250 2 " 

Pro Ser Asn He Phe He Glu Ser Tyr He Asp Asn Met Pro Gin Val 

260 265 2/0 

Phe Lys Ala He Asp Leu Val Val Cys Arg Ser Gly Ala Thr Thr Leu 
275 280 285 

Ala Glu He Met Ser Leu Gly Leu Ala Ser He Leu He Pro Ser Pro 
290 295 300 

Asn Val Thr Ala Asp His Gin Thr Lys Asn Ala Met Ser Leu Val Asn 
305 310 315 320 

Gin Gin Ala Gly Leu Met He Lys Glu Asn Asp Leu Asn Gly Gin Ser 

325 330 335 

Leu Leu Asn Cys Leu Asp Asp Leu Met His Asp Asp Ala Lys Arg Asn 

340 345 350 

Lys Met Ala Gin Gin Ala Lys Glu Met Gly Gin Pro Gin Ala Ser Asp 
355 360 365 

Lys Leu He Ala Leu He Leu Ser Met Val Lys Glu Asp He Asn Ser 
370 375 380 



Asp He Asp 
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385 



<210> 89 
<211> 1428 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (25) . . (1428) 

<223> 



Z^lU — s ; « K «. « « « 5; 

1 5 



aag gtt tta gtt tta ggc ttg gca aaa age ggg etc agt gcg gec cat 
Lys Val Leu Val Leu Gly Leu Ala Lys Ser Gly Leu Ser Ala Ala His 
10 15 20 2 



ttg tta aaa aaa eta ggg gec aag gtc ate gtc aat gac aag ttg gec 
III Leu Lys Lys Leu Gly Ala Lys Val He Val Asn Asp Lys Leu Ala 

30 35 



eta gaa aat aat acg gaa gec cag gtc tta att gaa gag ggc ttc caa 
III Glu Asn Asn Thr Glu Ala Gin Val Leu He Glu Glu Gly Phe Gin 

45 50 55 

ott ate acc ggc tac cac cca gag gat tta ctt gat gca age ttt gac 
?£ S! ^ Hy ryr His Pro Glu Asp Leu Leu Asp Ala Ser Phe Asp 
60 65 

ttt gtc gtc aag aat ccg ggc att cct tac acc aat cca gtg gta ggc 
Phe Val Val Lys Asn Pro Gly He Pro Tyr Thr Asn Pro Val Val Gly 
75 80 85 

cag get gaa aaa ctg get att ccc att tta act gaa gtg gac gtg gca 
Gin Ala Glu Lys Leu Ala He Pro He Leu Thr Glu Val Asp Val Ala 
90 95 100 



gga age ate tta aaa gec aag ccc ate get gtt acc ggg acc aat ggc 
lly Ser He Leu Lys Ala Lys Pro He Ala Val Thr Gly Thr Asn Gly 

110 115 

aag aca act acc gta tct tta att tat gat att tta gec caa gat caa 
lys Thr Thr Thr Val Ser Leu He Tyr Asp He Leu Ala Gin Asp Gin 

125 130 

gcg gaa age cct gaa cct aaa cca gtc tac aag eta ggc aat att ggc 
Ala Glu Ser Pro Glu Pro Lys Pro Val Tyr Lys Leu Gly Asn He Gly 
140 " 5 150 

caa ccg gtt agt gac ttg gec tta gaa att aaa get gaa tct aac ctg 
Gin Pro Val Ser Asp Leu Ala Leu Glu He Lys Ala Glu Ser Asn Leu 
155 ^ I 60 165 



99 



147 



195 



243 



29.1 



339 



387 



435 



483 



531 
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gtt gtc gaa etc tct agt ttc caa eta cag tea ctg acc tat ttc ace 
Val Val Glu Leu Ser Ser Phe Gin Leu Gin Ser Leu Thr Tyr Phe Thr 
- - 180 185 



170 175 



cct cat ata gca gtc att acc aat att tat tec gee cac ctt gac tac 
Pro His He Ala Val He Thr Asn He Tyr Ser Ala His Leu Asp Tyr 

190 195 200 

cat aag agt egg gag gaa tat gtt agg get aag eta agg att acc cag 
His Lys Ser Arg Glu Glu Tyr Val Arg Ala Lys Leu Arg lie Thr Gin 

205 210 215 

get caa ggt ccg gat gac tac eta gtc tac tac cag ggt cag gaa gaa 
Ala Gin Gly Pro Asp Asp Tyr Leu Val Tyr Tyr Gin Gly Gin Glu Glu 
220 225 230 

ttg get age ctg gtc aaa aaa tac tct aaa gee cag ctg gtc ccc tat 
Leu Ala Ser Leu Val Lys Lys Tyr Ser Lys Ala Gin Leu Val Pro Tyr 
235 240 245 

act gac aag ggt caa ctg aac caa gga gee tat ate aag gat gac tat 
Thr Asp Lys Gly Gin Leu Asn Gin Gly Ala Tyr He Lys Asp Asp Tyr 
250 255 260 265 

ctt ate tat aat caa gag cca gtc atg get tta gac cga gtt caa gtt 
Leu He Tyr Asn Gin Glu Pro Val Met Ala Leu Asp Arg Val Gin Val 

270 275 • 280 



egg ctt ttt gtc aac gac tct aag gca acc aat age ttg gee aca cag 
Arg Leu Phe Val Asn Asp Ser Lys Ala Thr Asn Ser Leu Ala Thr Gin 
330 335 340 345 

aag gca tta gaa gec tat gac caa gat acc ate ttg tta gtg ggt ggc 
Lys Ala Leu Glu Ala Tyr Asp Gin Asp Thr He Leu Leu Val Gly Gly 

350 355 360 

eta gac cgc caa gat gat ttt tec aag ctt gac cat get eta aac agg 
Leu Asp Arg Gin Asp Asp Phe Ser Lys Leu Asp His Ala Leu Asn Arg 

365 370 375 

gtt aag ggg gtc gtt tgt ttt ggc cag acc aaa gat aag tta gee egg 
Val Lys Gly Val Val Cys Phe Gly Gin Thr Lys Asp Lys Leu Ala Arg 
380 385 390 



579 



627 



675 



723 



771 



819 



867 



tct ggt age cac aac tta caa aat att tta gca get gtt tgc gta get 915 
Ser Gly Ser His Asn Leu Gin Asn He Leu Ala Ala Val Cys Val Ala 

285 290 295 



963 



aaa ata aag ggg etc tct aac caa acc att gee caa get gtc aac cac 

Lys He Lys Gly Leu Ser Asn Gin Thr He Ala Gin Ala Val Asn His 
300 305 310 

ttc aaa ggg gtt gee cac cgc age cag gtg gtt ggg egg tat gag gac 1011 

Phe Lys Gly Val Ala His Arg Ser Gin Val Val Gly Arg Tyr Glu Asp 
315 320 325 



1059 



1107 



1155 



1203 
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tat ttt aaa gac cgt cac att gag ggt gtt gag ctt gcc cag aca gtt 
Tyr Phe Lys Asp Arg His He Glu Gly Val Glu Leu Ala Gin Thr Val 

400 405 



rp , „ aa CTCa att gat ttg get tac gac ttg agt gag cca gga caa gtc 
Pro K Ala S Lp Leu Ala Tyr Asp Leu Ser Glu Pro Gly Gin Val 

410 415 420 

af , ttt tct cct get tgt gca agt tgg gac caa tat get aac ttt 

SS £eu Phe Ser Pro Ala Cys Ala Ser Trp Asp Gin Tyr Ala Asn Phe 



430 435 

~ rrrra <-*a crat tat att gat gca ate cag cag ctg gtt gaa 

gaa gag aga gga caa gat cac gcc gat ^ ^ ^ Ti<a „ Val Glu 

Gly 
445 



oa« eraa aaa aaa caa gat cac gcc gat y<-a — =» — « — - 

Glu Glu Arg G^y Gin Lp Tyr Val Asp Ala He Gin Gin Leu Val Glu 



aga eta gag caa agg age aag tat gga aac taa 
Arg Leu Glu Gin Arg Ser Lys Tyr Gly Asn 
460 465 



1251 



1299 



1347 



1395 



1428 



<210> 90 
<211> 467 
<212>. PRT 

<213> Alloiocbccus otitidis 

Me^Va^Asp Ser Val Phe Cys Asn Lys Lys Val Leu Val Leu Gly Leu 



10 



Ala Lys Ser Gly Leu Ser Ala Ala His Leu Leu Lys Lys Leu Gly Ala 

20 25 30 

Lys Val He Val Asn Asp Lys Leu Ala Leu Glu Asn Ash Thr Glu Ala 
35 40 45 

Gin Val Leu He Glu Glu Gly Phe Gin Val He Thr Gly Tyr His Pro 
50 55 60 

Glu Asp Leu Leu Asp Ala Ser Phe Asp Phe Val Val Lys Asn Pro Gly 
65 " 70 "75 



He Pro Tyr Thr Asn Pro Val Val Gly Gin Ala Glu Lys Leu Ala He 

85 9° 95 



Pro He Leu Thr Glu Val Asp Val Ala Gly Ser He Leu Lys Ala Lys 

100 105 110 

Pro He Ala Val Thr Gly Thr Asn Gly Lys Thr Thr Thr Val Ser Leu 
115 120 125 



WO 03/104391 



202/235 



PCT/US02/36122 



lie Tyr Asp He Leu Ala Gin Asp Gin Ala Glu Ser Pro Glu Pro Lys 
130 135 140 

Pro val Tyr Lys Leu Gly Asn He Gly Gin Pro Val Ser Asp Leu Ala 
145 150 155 

Leu Glu He Lys Ala Glu Ser Asn Leu Val Val Glu Leu Ser Ser Phe 

165 170 175 

Gin Leu Gin Ser Leu Thr Tyr Phe Thr Pro His He Ala Val He Thr 

180 185 I 90 

Asn He Tyr Ser Ala His Leu Asp Tyr His Lys Ser Arg Glu Glu Tyr 
195 200 205 

Val Arg Ala Lys Leu Arg He Thr Gin Ala Gin Gly Pro Asp Asp Tyr 
'210 215 220 



Leu Val Tyr Tyr Gin Gly Gin Glu Glu Leu Ala Ser Leu Val Lys Lys 
225 



230 235 240 



Tvr Ser Lys Ala Gin Leu Val Pro Tyr Thr Asp Lys Gly Gin Leu Asn 

245 250 255 

Gin Gly Ala Tyr He Lys Asp Asp Tyr Leu He Tyr Asn Gin Glu Pro 

260 265 270 

Val Met Ala Leu Asp Arg Val Gin Val Ser Gly Ser His Asn Leu Gin 
275 280 285 

Asn He Leu Ala Ala Val Cys Val Ala Lys He Lys Gly Leu Ser Asn 
290 295 300 

Gin Thr He Ala Gin Ala Val Asn His Phe Lys Gly Val Ala His Arg 
305 310 315 320 

Ser Gin Val Val Gly Arg Tyr Glu Asp Arg Leu Phe Val Asn Asp Ser 

325 330 335 

Lys Ala Thr Asn Ser Leu Ala Thr Gin Lys Ala Leu Glu Ala Tyr Asp . 

340 345 350 
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Gin Asp Tor He Leu Leu Val Gly Gly Leu Asp Arg Gin Asp Asp Phe 



355 



360 



Ser Lys Leu Asp His Ala Leu Asn Arg Val Lys Gly Val Val Cys Phe 
370 375 380 

Gly Gin Thr Lys Asp Lys Leu Ala Arg Tyr Phe Lys Asp Arg His lie 
385 390 . 395 

Glu Gly Val Glu Leu Ala Gin Thr Val Pro Qlu Ala Val Asp Leu Ala 

405 410 

Tyr Asp Leu Ser Glu Pro Gly Gin Val He Leu Phe Ser Pro Ala Cys 

425 



420 



Ala Ser Trp Asp Gin Tyr Ala Asn Phe Glu Glu Arg Gly Gin Asp Tyr 
435 440 445 

Val Asp Ala He Gin Gin Leu Val Glu Arg Leu Glu Gin Arg Ser Lys 

455 460 



450 



Tyr Gly Asn 
465 



<210> 91 
<211> 651 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (7) . . (651) 

<223> 



actagt^tg aag caa aaa act caa gcg aca gcg gtc aac cag acc caa 
atg aag ^ ^ ^ ^ ^ ^ ^ Val Asn Gln T hr Gin 

X 5 10 



aca gag gca gaa gaa aga caa gaa acc cgt egg aaa att ggc etc atg 
?hr Glu La Glu Glu Arg Gin Glu Thr Arg Arg Lys He Gly Leu Met 
15 20 25 

ggg ggg acc ttt aat ccg ccc cat ctg ggt cat tta atg gta get gaa 
35 G?y Thr Phe Asn Pro Pro His Leu Gly His Leu Leu Val Ala Glu 

35 40 

caa gtt tat gag gec ttg gac ttg gat aat att cac ttt atg ccc act 



48 



96 



144 



192 
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Gin Val Tyr Glu Ala Leu Asp Leu Asp Asn Xle His Phe Met Pro Thr 

50 55 SO 



gca aag ccg ggc cat gcc get ggt aag gaa acc ata gat gec tct tac 
Ala Lys Pro Gly His Ala Ala Gly Lys Glu Thr lie Asp Ala Ser Tyr 
65 ™ 75 ' 

egg gtt gat atg gtg gat tat gcc ate gaa gat aac ccc cac ttt tct 
Arg Val Asp Met Val Asp Tyr Ala lie Glu Asp Asn Pro His Phe Ser 
80 85 30 

ctt aac ttg act gaa gtg aac egg gga ggg aca act tac acc ate gat 
Leu Asn Leu Thr Glu Val Asn Arg Gly Gly Thr Thr Tyr Thr He Asp 
95 100 105 HO 

acc att aaa gaa ttg aaa gag get age ccg aat aca gat tat tac ttc 
Thr He Lys Glu Leu Lys Glu Ala Ser Pro Asn Thr Asp Tyr Tyr Phe 

115 120 125 

att att ggt gag gat tea gtt atg gat ttg gcc cag tgg aag aat att 
He He Gly Glu Asp Ser Val Met Asp Leu Ala Gin Trp Lys Asn lie 



130 



135 140 



gaa caa tta ctg gat tta gtt caa ttt gtt ggt gtg aag cga cca ggc 
Glu Gin Leu Leu Asp Leu Val Gin Phe Val Gly Val Lys Arg Pro Gly 
145 150 155 

tac caa get gat gtg gac ttt ccc att att tgg gtg gat acg cca gaa 
Tyr Gin Ala Asp Val Asp Phe Pro He He Trp Val Asp Thr Pro Glu 
160 165 170 

eta gat att agt tea agt gac ate agg caa agg gtg gca. gaa ggg caa 
Leu Asp He Ser Ser Ser Asp He Arg Gin Arg Val Ala Glu Gly Gin 
175 180 185 190 

tec att aaa tat ttg acc cca gat agg gta aga gat tat att gaa gac 
Ser He Lys Tyr Leu Thr Pro Asp Arg Val Arg Asp Tyr He Glu Asp 

195 . 200 205 

aat ggc tta tat aag ggt gaa gaa taa 
Asn Gly Leu Tyr Lys Gly Glu Glu 

210 



240 



288 



336 



384 



432 



480 



528 



576 



624 



651 



<210> 92 
<211> 214 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 92 

Met Lys Gin Lys Thr Gin Ala Thr Ala Val Asn Gin Thr Gin Thr Glu 
15 10 15 



Ala Glu Glu Arg Gin Glu Thr Arg Arg Lys He Gly Leu Met Gly Gly 

20 25 30 



r 
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Thr Phe Asn Pro Pro His Leu Gly His Leu Leu Val Ala Glu Gin Val 
35 40 45 

Tyr Glu Ala Leu Asp Leu Asp Asn He His Phe Met Pro Thr Ala Lys 
50 55 6° 

Pro Gly His Ala Ala Gly Lys Glu Thr He Asp Ala Ser Tyr Arg Val 
65 70 75 80 

Asp Met Val Asp Tyr Ala He Glu Asp Asn Pro His Phe Ser Leu Asn 

85 90 95 

Leu Thr Glu Val Asn Arg Gly Gly Thr Thr Tyr Thr He Asp Thr He 

100 105 11° 

Lys Glu Leu Lys Glu Ala Ser Pro Asn Thr Asp Tyr Tyr Phe He He 
115 120 125 

Gly Glu Asp Ser Val Met Asp Leu Ala Gin Trp Lys Asn He Glu Gin 
Y 130 135 1*0 

Leu Leu Asp Leu Val Gin Phe Val Gly Val Lys Arg Pro Gly Tyr Gin 
145 150 155 ibu 

Ala Asp Val Asp Phe Pro He He Trp Val Asp Thr Pro Glu Leu Asp 

165 170 17= 

He Ser Ser Ser Asp He Arg Gin Arg Val Ala Glu Gly Gin Ser He 

180 185 190 

Lys Tyr Leu Thr Pro Asp Arg Val Arg Asp Tyr He Glu Asp Asn Gly 
195 200 205 



Leu Tyr Lys Gly Glu Glu 
210 



<210> 93 
<211> 666 
<212> DNA 

<213> Alloiococcus otitidis 



<220> 

<221> CDS 

<222> (1) . . (666) 
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<223> 



<400> 93 

atg gta ggg gga etc att ttt gtc etc act gec age aat aaa agg aaa 
Met Val Gly Gly Leu He Phe Val Leu Thr Ala Ser Asn Lys Arg Lys 
1 5 10 15 

gga agt ttg tec atg ace tat ttg tta ggc eta acc ggt ggc att gee 
Glv Ser Leu Ser Met Tbr Tyr Leu Leu Gly Leu Thr Gly Gly He Ala 

20 25 30 

agt ggg aag tct act gtt age cag gtt ttt aag gaa aag ggt ate caa 
Ser Gly Lys Ser Thr Val Ser Gin Val Phe Lys Glu Lys Gly He Gin 
35 40 45 

gtg gtt gat get gac cga gtt gee cga cag gtt gtt gaa cct gga agt 
Val Val Asp Ala Asp Arg Val Ala Arg Gin Val Val Glu Pro Gly Ser 
50 55 60 



cca ggc tta gac cag ctt gtt gat tat ttt ggc cag gag att ttg acc 
Pro Gly Leu Asp Gin Leu Val Asp Tyr Phe Gly Gin Glu He Leu Thr 
65 "* 70 75 80. 



cag gat ggg ggc ttg gac cgc aaa tat tta ggc gac ctt ate ttc egg 
Gin Asp Gly Gly Leu Asp Arg Lys Tyr Leu Gly Asp Leu He Phe Arg 

85 90 95 

aat age cag gec aag gag get gtc aac egg ate etc cac cct ttg att 
Asn Ser Gin Ala Lys Glu Ala Val Asn Arg He Leu His Pro . Leu He 

100 105 11° 

agg cag tct ate caa aat caa att aaa act gee ata ggc caa gac ttg 
Arg Gin Ser He Gin Asn Gin He Lys Thr Ala lie Gly Gin Asp Leu 
115 120 125 

gat ttg tta gtt tta gac ate ccc etc ctt tac gag aca ggt cag gca 
Asp Leu Leu Val Leu Asp He Pro Leu Leu Tyr Glu Thr Gly Gin Ala 
130 13 5 140 



gac gac tac cag gee gtc atg gtg gtt teg ctt ccc tac cag gac cag 
Asp Asp Tyr Gin Ala Val Met Val Val Ser Leu Pro Tyr Gin Asp Glri 
145 ~ 150 155 160 

gtg agt egg tta atg gac egg gat ggg att gac cga gac caa gee ctg 
Val Ser Arg Leu Met Asp Arg Asp Gly He Asp Arg Asp Gin Ala Leu 

165- 170 I 75 

cgc aag att cag gee caa atg tea ttg gaa gaa aaa gtg aag ttg gcg 
Arg Lys He Gin Ala Gin Met Ser Leu Glu Glu Lys Val Lys Leu Ala 

180 1B5 190 

gac tat gtc att gat aac age gga age aag gaa gaa age cgt cag cag 
Asp Tyr Val He Asp Asn Ser Gly Ser Lys Glu Glu Ser Arg Gin Gin 
195 200 205 

gtt gaa get tgg ttg gat caa aag ggt ttt aaa aac ttg taa 
Val Glu Ala Trp Leu Asp Gin Lys Gly Phe Lys Asn Leu 



48 



96 



144 



19 



240 



288 



336 



384 



432 



480 



528 



576 



624 



666 
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210 



215 220 



<210> 94 
<211> 221 
<212> PRT 

<213> Alloiococcus otitidis 

<400> 94 _ 
Met Val Gly Gly Leu lie Phe Val Leu Thr Ala Ser Asn Lys Arg Lys 

1 5 10 15 

Gly Ser Leu Ser Met Thr Tyr Leu Leu Gly Leu Thr Gly Gly He Ala 

20 25 30 

Ser Gly Lys Ser Thr Val Ser Gin Val Phe Lys Glu Lys Gly He Gin 
35 40 45 

Val Val Asp Ala Asp Arg Val Ala Arg Gin Val Val Glu Pro Gly Ser 
50 55 60 

Pro Gly Leu Asp Gin Leu Val Asp Tyr Phe Gly Gin Glu He Leu Thr 
65 70 75 80 

Gin Asp Gly Gly Leu Asp Arg Lys Tyr Leu Gly Asp Leu He Phe Arg 

85 90 95 

Asn Ser Gin Ala Lys Glu Ala Val Asn Arg He Leu His Pro Leu He 

100 105 HO 

Arg Gin Ser He Gin Asn Gin He Lys Thr Ala He Gly Gin Asp Leu 
115 120 125 

Asp Leu Leu Val Leu Asp He Pro Leu Leu Tyr Glu Thr Gly Gin Ala 
130 135 140 

Asp Asp Tyr Gin Ala Val Met Val Val Ser Leu Pro Tyr Gin Asp Gin 
145 150 155 160 

Val Ser Arg Leu Met Asp Arg Asp Gly He Asp Arg Asp Gin Ala Leu 

165 170 175 

Arg Lys He Gin Ala Gin Met Ser Leu Glu Glu Lys Val Lys Leu Ala 

180 185 190 
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96 



144 



Asp Tyr Val He Asp Asn Ser Gly Ser Lys Glu Glu Ser Arg Gin Gin 
195 200 205 

Val Glu Ala Trp Leu Asp Gin Lys Gly Phe Lys Asn Leu 
210 215 220 

<210> 95 
<211> 1335 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (1335) 

<223> 

gg^atfgac caa gac acc ate tat cac ttt gtt ggc att aaa gga tct 48 
Me? Asp Gin Asp Thr He Tyr His Phe Val Gly He Lys Gly Ser 
1 5 10 15 

acc atg agt tea ctt gec act ate ttg ttt gac aag ggc tta aat gtc 
G?y Me? Ser Ser Leu Ala Thr He Leu Phe Asp Lys Gly Leu Asn Val 

20 25 30 

caa gga tct gat gtc aaa aag tat ttc ttt acc caa aaa age tta gaa 
Gin lly Ser Asp Val Lys Lys Tyr Phe Phe Thr Gin Lys Ser Leu Glu 

35 40 45 

gaa aaa aat ata aac att tta gaa ttt gac cct gat aac ate aaa cca 
Glu Lys Asn He Asn He Leu Glu Phe Asp Pro Asp Asn He Lys Pro 
50 55 60 

ggt atg acc ctg ata gca ggc aat gec ttt gga gae aac cat ccc gag 
Gly Met Thr Leu He Ala Gly Asn Ala Phe Gly Asp Asn His Pro Glu 
65 70 75 

ctg gtc cga ggt cga gag etc ggt tta gaa ate ate cgc tac cat gat 
III Val A?g lly Arg Glu Leu Gly Leu Glu lie He Arg Tyr Hxs Asp 
80 85 90 95 

ttt ate ggt gac ctt ate gaa cac ttt act tec ate get att acc ggg 
Phe He Gly Asp Leu He Glu His Phe Thr Ser He Ala He Thr Gly 

100 105 110 

tct cac ggt aag acc tec aca act ggt ttg atg gec cat gtt ttc tec 
Ser His Gly Lys Thr Ser Thr Thr Gly Leu Met Ala His Val Phe Ser 

115 120 125 

ggt att gat age acc tec tac tta att gga gat ggg acc ggc cat ggg 
lly He Asp Ser Thr Ser Tyr Leu He Gly Asp Gly Thr Gly His Gly 
X30 I 35 140 

gaa aaa ggt gec aag tat ttt gtc ttg gaa gec tgc gaa tac aag egg 
Glu Lys lly Ala Lys Tyr Phe Val Leu Glu Ala Cys Glu Tyr Lys Arg 



192 



240 



288 



336 



384 



432 



480 
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145 150 155 



^ n-t- tta acc tac cga ccg gac tat gcg gtt atg acc aat att gac 
His HI Leu HI Tyr Arg Pro Asp Tyr Ala Val Met Thr Asn He Asp. 
160 165 I 70 

,-t-t- aac cac cca gac tat tac aag tct att gaa gat gtc caa gtg gcc 
Phe Asp His Pro Asp Tyr Tyx Lys Ser lie Glu Asp Val Gin Val Ala 

180 185 

ttt gat gaa ttc age cac cag gtc aaa aaa tac etc ttt gcc tgc ggg 
Phe Lp Glu Phe Ser His Gin Val Lys Lys Tyr Leu Phe Ala Cys Gly 

195 

gac gac caa cgt ctt egg cag gtc aaa gcc cag gtg ccg gtc att tac 
Asp asp Gin Arg Leu Arg Gin Val Lys Ala Gin Val Pro Val He Tyr 
210 215 

tac ggt eta aat gaa gac aat gac ttt gtg get aaa aac ate gac cga 
Tyr 2J Leu Asn Glu Asp Asn Asp Phe Val Ala Lys Asn He Asp Arg 
■■ 225 230 235 

agt cgt gaa ggg tct gee ttc gac ctt tat att aag gga gaa ttt tac 
Ser Arg Glu Gly Ser Ala Phe Asp Leu Tyr lie Lys Gly Glu Phe Tyr 
240 245 - 5U 



aaa cac ttc acc ate cca acc tat ggc aac cac aat att caa aat gcc 
Lys £ Phe Thr He Pro Thr Tyr Gly Asn His Asn He Gin Asn Ala 



260 



ttg gcg gtt ata gca gta get tac tac gaa ggg tta gac caa gat ttg 
III III Val lie Ala Val Ala Tyr Tyr Glu Gly Leu Asp Gin Asp Leu 

275 280 

gtt gcc caa aga ttg get aat ttt get ggg gtg aaa cgc egg ttt acc 
Val Ala Gin Arg Leu Ala Asn Phe Ala Gly Val Lys Arg Arg Phe Thr 
290 295 30° 

gag aag gtg gtc ggg gac act act att ate gat gac tat get cac cac 
Glu Lys Val Val Gly Asp Thr Thr He He Asp Asp Tyr Ala His His 
305 310 315 



cct get gaa ata agg gca acg att gat gcg gcc egg caa aaa tac ccg 
Pro 
320 



ecu gen gaa ata ayy => — - - - - — - _ m,^ p r n 

Pro Ala Glu He Arg Ala Thr He Asp Ala Ala Arg Gin Lys Tyr Pro 

325 330 



aaa aac att gt g acg gtc ttc cag ccc cac acc ttt acc egg aca 
Asp Lys K S S3 Thr Val Phe Gin Pro His Thr Phe Thr Arg Thr 

340 345 

gtc gcc etc eta gat gaa ttt gcc cag gcc ttg gac ttg gca gac cag 
Val La Leu Leu Asp Glu Phe Ala Gin Ala Leu Asp Leu Ala Asp Gin 

355 360 365 

gtt tac ttg tgt gat ate ttt aat tea get aga gaa aag tea ggc gat 
Val Tyr Leu Cys Asp He Phe Asn Ser Ala Arg Glu Lys Ser Gly Asp 
370 375 380 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 
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att tec ate caa gat ctt ttg get aaa acc age aag gee gac cag gtg 
lie Ser lie Gin Asp Leu Leu Ala Lys Thr Ser Lys Ala Asp Gin Val 
385 390 395 

att gag gaa gac gat gtg tct cct ctg ctt gac caa cat ggg caa gtg 
He Glu Glu Asp Asp Val Ser Pro Leu Leu Asp Gin His Gly Gin Val 
400 405 410 415 

att att ttc atg gga gca gga gac ate age aag ttt gaa aaa gee tat 
He He Phe Met Gly Ala Gly Asp He Ser Lys Phe Glu Lys Ala Tyr 

420 425 430 

gaa age ttg ttg age tea acc tac cac tec cag gtc taa 
Glu Ser Leu Leu Ser Ser Thr Tyr His Ser Gin Val 

435 440 



<210> 96 
<211> 443 
<212> PRT 

<213> Alloiococcus otitidis 

<400> 96 ^ „ _ 

Met Asp Gin Asp Thr He Tyr His Phe Val Gly He Lys Gly Ser Gly 

15 10 I 5 

Met Ser Ser Leu Ala Thr He Leu Phe Asp Lys Gly Leu Asn Val Gin 

20 25 30 

Gly Ser Asp Val Lys Lys Tyr Phe Phe Thr Gin Lys Ser Leu Glu Glu 
35 40 " 45 

Lys Asn He Asn He Leu Glu Phe Asp Pro Asp Asn He Lys Pro Gly 
50 55 60 

Met Thr Leu He Ala Gly Asn Ala Phe Gly Asp Asn His Pro Glu Leu 
65 70 75 80 

Val Arg Gly Arg Glu Leu Gly Leu Glu He He Arg Tyr His Asp Phe 

85 90 3 5 

He Gly Asp Leu He Glu His Phe Thr Ser He Ala He Thr Gly Ser 

100 105 HO 

His Gly Lys Thr Ser Thr Thr Gly Leu Met Ala His Val Phe Ser Gly 
115 120 125 

He Asp Ser Thr Ser Tyr Leu He Gly Asp Gly Thr Gly His Gly Glu 



1200 



1248 



1296 



1335 



130 
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135 140 



Lys Gly Ala Lys Tyr Phe Val Leu Glu Ala Cys Glu Tyr Lys Arg His 
145 150 155 

Phe Leu Ala Tyr Arg Pro Asp Tyr Ala Val Met Thr Asn lie Asp Phe 

165 170 175 

Asp His Pro Asp Tyr Tyr Lys Ser lie Glu Asp Val Gin Val Ala Phe 

180 185 190 

Asp Glu Phe Ser His Gin Val Lys Lys Tyr Leu Phe Ala Cys Gly Asp 
19 5 200 205 

Asp Gin Arg Leu Arg Gin Val Lys Ala Gin Val Pro Val He Tyr Tyr 
210 215 220 

Gly Leu Asn Glu Asp Asn Asp Phe Val Ala Lys Asn He Asp Arg Ser 
1 n o n 935 240 

225 230 ZJD 

Arg Glu Gly Ser Ala Phe Asp Leu Tyr He Lys Gly Glu Phe Tyr Lys 

245 > 250 255 

His Phe Thr He Pro Thr Tyr Gly Asn His Asn He Gin Asn Ala Leu 

260 265 270 

Ala Val He Ala Val Ala Tyr Tyr Glu Gly Leu Asp Gin Asp Leu Val 
275 280 285 

Ala Gin Arg Leu Ala Asn Phe Ala Gly Val Lys Arg Arg Phe Thr Glu 
290 295 300 

Lvs Val Val Gly Asp Thr Thr He He Asp Asp Tyr Ala His His Pro 
305 310 315 320 

Ala Glu He Arg Ala Thr He Asp Ala Ala Arg Gin Lys Tyr Pro Asp 

325 330 335 

Lys Asp He Val Thr Val Phe Gin Pro His Thr Phe Thr Arg Thr Val 

340 345 350 



Ala Leu Leu Asp Glu Phe Ala Gin Ala Leu Asp Leu Ala Asp Gin 
355 360 365- 
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Tyr Leu Cys Asp lie Phe Asn Ser Ala Arg Glu Lys Ser Gly Asp lie 
370 375 380 

Ser lie Gin Asp Leu Leu Ala Lys Thr Ser Lys Ala Asp Gin Val lie 
385 390 395 400 

Glu Glu Asp Asp Val Ser Pro Leu Leu Asp Gin His Gly Gin Val lie 

405 410 415 

He Phe Met Gly Ala Gly Asp He Ser Lys Phe Glu Lys Ala Tyr Glu 

420 425 430 



Ser Leu Leu Ser Ser Thr Tyr His Ser Gin Val 
435 440 



<210> 97 
<211> 1050 
<212> D13A 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (19) . . (1050) 
<223> 

<400> 97 

acaaaattat ttacgtgt atg gag gaa tta ata gtg cca tta tta gac tta 

Met Glu Glu Leu lie Val Pro Leu Leu Asp Leu 
15 10 

aat gac cat gac cgc gtt cag gaa tat gag gac ttt gtc caa aac cac 
Asn Asp His Asp Arg Val Gin Glu Tyr Glu Asp Phe Val Gin Asn His 

15 20 25 



aag gca tgc ttg tec att eta tea gtc aaa aat gac gga gaa cat gec 
Lys Ala Cys Leu Ser He Leu Ser Val Lys Asn Asp Gly Glu His Ala 
60 65 70 75 

ttc tta tat gcg cca aga ggg ccg gtt tgt gac ttt cat gat aca gac 
Phe Leu Tyr Ala Pro Arg Gly Pro Val Cys Asp Phe His Asp Thr Asp 

80 85 90 



51 



99 



ccc cag ggc cac ctg atg cag tct acc aaa tgg ate cag gtt aag gaa 147 
Pro Gin Gly His Leu Met Gin Ser Thr Lys Trp lie Gin Val Lys Glu 
30 35 40 

ggc tgg gac ggt gac tat gtt tac ctt acc gat gac caa gac egg ate 195 
Gly Trp Asp Gly Asp Tyr Val Tyr Leu Thr Asp Asp Gin Asp Arg He 
45 50 55 



243 



291 
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ttg gtg acc gac tta att 
Leu Val Thr Asp Leu lie 

95 

aag gcc ttt ttg ttg egg 
Lys Ala Phe Leu Leu Arg 
110 

ctg gtc gaa aaa tac cgc 
Leu Val Glu Lys Tyr Arg 
125 



aag gaa gcc caa gtc gta 
Lys Glu Ala Gin Val Val 
100 

atg gac ccg gaa acc ctt 
Met Asp Pro Glu Thr Leu 
115 

gat tta ggc tat act ttc 
Asp Leu Gly Tyr Thr Phe 
130 135 



gcg gac aag cac 339 
Ala Asp Lys His 
105 

cat gat cct gac 387 

His Asp Pro Asp 

120 

egg tea get gag 435 
Arg Ser Ala Glu 



caa gaa gat gaa cac gtc ttc tec aac ccc cgc ttc cac atg atg acg 
Gin Glu Asp Glu His Val Phe Ser Asn Pro Arg Phe His Met Met Thr 
140 145 150 155 



483 



gac tta agg ggt cat gat gaa gaa age ttg ctg atg gcc ttc acc age 531 
Asp Leu Arg Gly His Asp Glu Glu Ser Leu Leu Met Ala Phe Thr Ser 

160 165 170 



aat aac egg cgc aag ate cgc aaa act tac aaa aat aac etc cag acc 
Asn Asn Arg Arg Lys lie Arg Lys Thr Tyr Lys Asn Asn Leu Gin Thr 

175 180 185 

cac tat ctg acc gtg gat gat gag ggt tat gac cag gcc ttg gat gac 
His Tyr Leu Thr Val Asp Asp Glu Gly Tyr Asp Gin Ala Leu Asp Asp 
190 195 200 

ttt tat gaa ttg acc caa ata atg gca gaa egg caa ggg att act cac 
Phe Tyr Glu Leu Thr Gin Xle Met Ala Glu Arg Gin Gly lie Thr His 
205 . 210 215 



ttg gtg age tat aat aaa aaa tec ttc tac atg tat gca get tct tec 
Leu Val Ser Tyr Asn Lys Lys Ser Phe Tyr Met Tyr Ala Ala Ser Ser 

255 260 265 

aac aaa aaa cga aat tta aat ggg tct ttg caa gaa aat tac gaa gcc 
Asn Lys Lys Arg Asn Leu Asn Gly Ser Leu Gin Glu Asn Tyr Glu Ala 
270 275 280 



gtc ttt ggc ttt gac aag teg gac ggc etc tac egg ttt aaa aaa ate 
Val Phe Gly Phe Asp Lys Ser Asp Gly Leu Tyr Arg Phe Lys Lys lie 
300 ^ 305 310 315 



579 



627 



675 



egg ccc aaa gac tac ttt gac egg tta atg cac age ttt gag gat get 723 
Arg Pro Lys Asp Tyr Phe Asp Arg Leu Met His Ser Phe Glu Asp Ala 
220 "* " 225 230 235 

aaa ttg ttc cag acc tac cac gaa gat gac etc eta get act tgt ate 771 
Lys Leu Phe Gin Thr Tyr His Glu Asp Asp Leu Leu Ala Thr Cys He 

240 245 250 



819 



867 



atg aag tat gcc ttg gcc cga gga age gaa gaa tat gat atg ggt ggg 915 
Met Lys Tyr Ala Leu Ala Arg Gly Ser Glu Glu Tyr Asp Met Gly Gly 
285 290 295 



963 



ttt acc ggt cat gaa ggg ctg aaa gaa ttt atg ggt gaa ttg gat gtg 



1011 
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Phe Thr Gly His Glu Gly Leu Lys GXu Phe Met Gly Glu Leu Asp Val 

320 325 330 

gtc tat gac caa gac eta tac gac gat ttt att tct taa 
Val Tyr Asp Gin Asp Leu Tyr Asp Asp Phe lie Ser 

335 340 



<210> 98 
<211> 343 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 98 

Met Glu Glu Leu lie Val Pro Leu Leu Asp Leu Asn Asp His Asp Arg 
15 10 15 



Val Gin Glu Tyr Glu Asp Phe Val Gin Asn His Pro Gin Gly His Leu 

20 25 30 



Met Gin Ser Thr Lys Trp He Gin Val Lys Glu Gly Trp Asp Gly Asp 
35 40 45 



Tyr Val Tyr Leu Thr Asp Asp Gin Asp Arg He Lys Ala Cys Leu Ser 
50 55 60 



He Leu Ser Val Lys Asn Asp Gly Glu His Ala Phe Leu Tyr Ala Pro 
65 70 75 80 



Arg Gly Pro Val Cys Asp Phe His Asp Thr Asp Leu Val Thr Asp Leu 

85 90 95 



He Lys Glu Ala Gin Val Val Ala Asp Lys His Lys Ala Phe Leu Leu 

100 105 110 



Arg Met Asp Pro Glu Thr Leu His Asp Pro Asp Leu Val Glu Lys Tyr 
115 . 120 125 



Arg Asp Leu Gly Tyr Thr Phe Arg Ser Ala Glu Gin Glu Asp Glu His 
130 135 140 



Val Phe Ser Asn Pro Arg Phe His Met Met Thr Asp Leu Arg Gly His 
145 150 155 160 



Asp Glu Glu Ser Leu Leu Met Ala Phe Thr Ser Asn Asn Arg Arg Lys 

165 170 175 



1050 
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lie Arg Lys Thr Tyr Lys Asn Asn Leu Gin Thr His Tyr Leu Thr Val 

180 185 190 



Asp Asp Glu Gly Tyr Asp Gin Ala Leu Asp Asp Phe Tyr Glu Leu Thr 
195 200 205 



Gin lie Met Ala Glu Arg Gin Gly lie Thr His Arg Pro Lys Asp Tyr 
210 215 220 



Phe Asp Arg Leu Met His Ser Phe Glu Asp Ala Lys Leu Phe Gin Thr 
225 230 235 240 



Tyr His Glu Asp Asp Leu Leu Ala Thr Cys He Leu Val Ser Tyr Asn 

245 250 255 



Lys Lys Ser Phe Tyr Met Tyr Ala Ala Ser Ser Asn Lys Lys Arg Asn 

260 265 270 



Leu Asn Gly Ser Leu Gin Glu Asn Tyr Glu Ala Met Lys Tyr Ala Leu 
275 280 285 



Ala Arg Gly Ser Glu Glu Tyr Asp Met Gly Gly Val Phe Gly Phe Asp 
290 295 300 



Lys Ser Asp Gly Leu Tyr Arg Phe Lys Lys He Phe Thr Gly His Glu 
305 310 315 320 



Gly Leu Lys Glu Phe Met Gly Glu Leu Asp Val Val Tyr Asp Gin Asp 

325 330 335 



Leu Tyr Asp Asp Phe He Ser 

340 



<210> 99 
<211> 2244 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (22) . . (2244) 

<223> 

<400> 99 

ttacgtgaaa ggaagacttg c atg ggc eta gca aaa gat att tta ggc aaa 51 
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Met Gly Leu Ala Lys Asp He Leu Gly Lys 
15 10 

atg aat gac aaa caa aaa caa gcg gtc atg acc act gat ggc cct etc 
Met Asn Asp Lys Gin Lys Gin Ala Val Met Thr Thr Asp Gly Pro Leu 

15 20 25 



egg ata get tac ttg ate caa gaa aaa ggg gtt aat cct tgg aat ate 
Arg He Ala Tyr Leu He Gin Glu Lys Gly Val Asn Pro Trp Asn He 
45 50 55 

tta gee ate acc ttt acc aac aag gcg get ggc gag atg aaa gac egg 
Leu Ala He Thr Phe Thr Asn Lys Ala Ala Gly Glu Met Lys Asp Arg 
60 65 70 



ttc cac tct atg tgt gtt cgc att eta aga agg gac ggg gac caa att 
Phe His Ser Met Cys Val Arg He Leu Arg Arg Asp Gly Asp Gin He 

95 100 105 

ggc tat aac cgt gee ttc acc att get gac cct agt gaa cag aaa agt 
Gly Tyr Asn Arg Ala Phe Thr He Ala Asp Pro Ser Glu Gin Lys Ser 

110 115 120 

ttg atg aag cag gtc tta aaa gac ttg aat att gat cct aaa cgt tac 
Leu Met Lys Gin Val Leu Lys Asp Leu Asn He Asp Pro Lys Arg Tyr 
125 130 135 

aac ccc aag gcg ata ttg gee gag att tec aat gee aaa aat gac etc 
Asn Pro Lys Ala He Leu Ala Glu He Ser Asn Ala Lys Asn Asp Leu 
140 " 145 150 



gtg gtg get gac tgc tac gat get tac caa aga cag etc cgc cag tct 
Val Val Ala Asp Cys Tyr Asp Ala Tyr Gin Arg Gin Leu Arg Gin Ser 

175 180 185 

gag gee atg gac ttt gac gac ctg att atg caa acc gtc cgt etc ttc 
Glu Ala Met Asp Phe Asp Asp Leu He Met Gin Thr Val Arg Leu Phe 

190 195 200 



99 



ttg ate atg get ggg gca gga tct ggc aag acc egg gtc tta acc cac 147 
Leu He Met Ala Gly Ala Gly Ser Gly Lys Thr Arg Val Leu Thr His 

30 35 40 



195 



243 



gtc cag aaa ctg gtt age cag gga gga tct gga gtt tgg gtc teg act 291 
Val Gin Lys Leu Val Ser Gin Gly Gly Ser Gly Val Trp Val Ser Thr 
75 80 85 90 



339 



387 



435 



483 



ttg gat gag caa acc tac egg aaa caa get gat gac tat ttt aag gaa . . 5 ^1 

Leu Asp Glu Gin Thr Tyr Arg Lys Gin Ala Asp Asp Tyr Phe Lys Glu 
155 160 165 170 



579 



627 



aag gaa aag ccc gat acc ctg tct tac tac cag gee aag ttc cag tat 675 
Lys Glu Lys Pro Asp Thr Leu Ser Tyr Tyr Gin Ala Lys Phe Gin Tyr 
205 210 215 



ate cat gtt gac gaa tac cag gat acc aac caa gee caa tac caa ctg 
He His Val Asp Glu Tyr Gin Asp Thr Asn Gin Ala Gin Tyr Gin Leu 



723 
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220 225 230 

gtt caa ctg eta gec caa cgc ttt aaa aat gtt tgc gtc gtg gga gat 
Val Gin Leu Leu Ala Gin Arg Phe Lys Asn Val Cys Val Val Gly Asp 
235 240 245 250 

get gac cag tct att tat ggt tgg egg ggg get gat atg gga aat att 
Ala Asp Gin Ser lie Tyr Gly Trp Arg Gly Ala Asp Met Gly Asn lie 

255 260 265 

ttg aat ttc gaa aaa gac tat cca gaa gee caa ace ate ttt ttg gaa 
Leu Asn Phe Glu Lys Asp Tyr Pro Glu Ala Gin Thr He Phe Leu Glu 

270 275 280 

caa aat tac egg tea ace aag tct ata ate agg gca gee aat gat gtt 
Gin Asn Tyr Arg Ser Thr Lys Ser He He Arg Ala Ala Asn Asp Val 
285 290 295 

ate caa aac aat ate aac cgc egg gac aag aat ttg tgg act gee aac 
He Gin Asn Asn He Asn Arg Arg Asp Lys Asn Leu Trp Thr Ala Asn 
300 305 310 

gat gag ggg gac aag gtc age tta tac get gec egg age gag cag gat 
Asp Glu Gly Asp Lys Val Ser Leu Tyr Ala Ala Arg Ser Glu Gin Asp 
315 320 325 330 

gaa gec cag ttt ate gta ggg acc ate cat gac eta aca gaa ggc aaa 
Glu Ala Gin Phe He Val Gly Thr He His Asp Leu Thr Glu Gly Lys 

335 340 345 

aag get ggc tat ggg gac ate gec ate etc tac egg acc aat gec atg 
Lys Ala Gly Tyx Gly Asp He Ala He Leu Tyr Arg Thr Asn Ala Met 

350 355 360 

tec egg gtt att gaa gaa acc ttt ate aag teg aat ate ccc tac aag 
Ser Arg Val He Glu Glu Thr Phe lie Lys Ser Asn He Pro Tyr Lys 
365 370 375 

ate gtc ggc gga acc ggc ttt tac caa aga aaa gaa ate cgt gac ctg 
He Val Gly Gly Thr Gly Phe Tyr Gin Arg Lys Glu lie Arg Asp Leu 
380 385 390 

att gee tac eta acc eta gtg get aac cca get gat gac ctg tec ttt 
He Ala Tyr Leu Thr Leu Val Ala Asn Pro Ala Asp Asp Leu Ser Phe 
395 400 405 410 

tea egg ate gtt aat gag ccc aaa aga ggg att gga ccc ggc acc ctg 
Ser Arg He Val Asn Glu Pro Lys Arg Gly He Gly Pro Gly Thr Leu 

415 420 425 

gac aag tta cgc cag get ggc cag gag atg ggt tgg teg ctt tac gaa 
Asp Lys Leu Arg Gin Ala Gly Gin Glu Met Gly Trp Ser Leu Tyr Glu 

430 435 440 

aca get etc aat gcg gat get acc aac ctg cct agt egg get gtc aac 
Thr Ala Leu Asn Ala Asp Ala Thr Asn Leu Pro Ser Arg Ala Val Asn 
445 450 455 
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aga eta tta gac ttc agt caa atg att gaa aat ttc agg aaa atg acg 1443 
Arg Leu Leu Asp Phe Ser Gin Met lie Glu Asn Phe Arg Lys Met Thr 
460 465 470 

gaa tac tta ccg att act gat ttg acc gaa aaa ate tta gag gat act 1491 
Glu Tyr Leu Pro lie Thr Asp Leu Thr Glu Lys lie Leu Glu Asp Thr 
475 480 485 490 

ggc tac caa aaa gec tta gaa aaa gac egg act ctt gaa tct cag gca 1539 
Gly Tyr Gin Lys Ala Leu Glu Lys Asp Arg Thr Leu Glu Ser Gin Ala 

495 500 505 

agg tta gag aac eta cag gaa ttt tac tec gtc acc cag gaa ttt gac 1587 
Arg Leu Glu Asn Leu Gin Glu Phe Tyr Ser Val Thr Gin Glu Phe Asp 

510 515 520 

cag caa gaa gac gac aac aag tea etc tta gee ttc tta act gac ctt 163 5 

Gin Gin Glu Asp Asp Asn Lys Ser Leu Leu Ala Phe Leu Thr Asp Leu 
525 530 535 

tec tta ttg tea cca get gat gat gtt gaa gag ggt egg ggc cag gtc 1683 
Ser Leu Leu Ser Pro Ala Asp Asp Val Glu Glu Gly Arg Gly Gin Val 
540 545 550 

acc atg atg acc etc cat gca gee aag ggg ttg gaa ttc ccc tat gtc 1731 
Thr Met Met Thr Leu His Ala Ala Lys Gly Leu Glu Phe Pro Tyr Val 
555 560 565 570 

ttt ate get ggt atg gaa gag gga ate ttc ccc ttg tec egg gcg get 1779 
Phe lie Ala Gly Met Glu Glu Gly lie Phe Pro Leu Ser Arg Ala Ala 

575 580 585 

gaa gac ccg gaa age ttg gaa gaa gag cga cga ctg gee tat gta ggg 1827 
Glu Asp Pro Glu Ser Leu Glu Glu Glu Arg Arg Leu Ala Tyr Val Gly 

590 595 600 

att acc egg get gag cag gee etc tac eta acc cgt gec atg atg cgc 1875 
He Thr Arg Ala Glu Gin Ala Leu Tyr Leu Thr Arg Ala Met Met Arg 
605 610 615 

caa etc tat ggc egg acc cag get aat ccc aaa tct cgc ttt tta tct 1923 
Gin Leu Tyr Gly Arg Thr Gin Ala Asn Pro Lys Ser Arg Phe Leu Ser 
620 625 630 

gaa att tct tct gac ctg gtc caa gac ctt ggt get aca act ggg tct 1971 
Glu He Ser Ser Asp Leu Val Gin Asp Leu Gly Ala Thr Thr Gly Ser 
635 640 645 650 

ctt age cag act ggg ggg aaa gtt age cct aga eta gga ggc cgc aaa 2019 
Leu Ser Gin Thr Gly Gly Lys Val Ser Pro Arg Leu Gly Gly Arg Lys 

655 660 665 

gec agt ggt tat aag get aat get tgg tct cag caa tea gtt ggg gcg 2 067 

Ala Ser Gly Tyr Lys Ala Asn Ala Trp Ser Gin Gin Ser Val Gly Ala 

670 675 680 
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act ggg get gaa aaa gaa gac tgg gaa gtt ggt gac aag gtc cac cac 2115 
Thr Gly Ala Glu Lys Glu Asp Trp Glu Val Gly Asp Lys Val His His 
685 690 695 



aaa aaa tgg ggc caa gga acc att att gag att aaa ggt tct ggc teg 

Lys Lys Trp Gly Gin Gly Thr He He Glu lie Lys Gly Ser Gly Ser 

700 705 710 

gac etc cag etc aac att gec ttt cca gat gaa ggg ate aag ccc ttg 

Asp Leu Gin Leu Asn He Ala Phe Pro Asp Glu Gly He Lys Pro Leu 

715 720 725 730 

eta gec agt ttt gee ccc ate gaa aag att tag 
Leu Ala Ser Phe Ala Pro He Glu Lys He 

735 740 



<210> 100 
<211> 740 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 100 

Met Gly Leu Ala Lys Asp He Leu Gly Lys Met Asn Asp Lys Gin Lys 
1 5 ' 10 15 

Gin Ala Val Met Thr Thr Asp Gly Pro Leu Leu He Met Ala Gly Ala 

20 25 30 



Gly Ser Gly Lys Thr Arg Val Leu Thr His Arg He Ala Tyr Leu He 
35 40 45 



Gin Glu Lys Gly Val Asn Pro Trp Asn He Leu Ala He Thr Phe Thr 
50 55 60 



Asn Lys Ala Ala Gly Glu Met Lys Asp Arg Val Gin Lys Leu Val Ser 
65 "* 70 75 80 



Gin Gly Gly Ser Gly Val Trp Val Ser Thr Phe His Ser Met Cys Val 

85 90 95 



Arg He Leu Arg Arg Asp Gly Asp Gin He Gly Tyr Asn Arg Ala Phe 

100 105 110 



Thr He Ala Asp Pro Ser Glu Gin Lys Ser Leu Met Lys Gin Val Leu 
115 120 125 



2163 



2211 



2244 



Lys Asp Leu Asn He Asp Pro Lys Arg Tyr Asn Pro Lys Ala He Leu 
130 135 140 
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Ala Glu lie Ser Asn Ala Lys Asn Asp Leu Leu Asp Glu Gin Thr Tyr 
145 150 155 160 



Arg Lys Gin Ala Asp Asp Tyr Phe Lys Glu Val Val Ala Asp Cys Tyr 

165 170 175 



Asp Ala Tyr Gin Arg Gin Leu Arg Gin Ser Glu Ala Met Asp Phe Asp 

180 185 190 



Asp Leu lie Met Gin Thr Val Arg Leu Phe Lys Glu Lys Pro Asp Thr 
195 200 205 



Leu Ser Tyr Tyr Gin Ala Lys Phe Gin Tyr lie His Val Asp Glu Tyr 
210 215 220 



Gin Asp Thr Asn Gin Ala Gin Tyr Gin Leu Val Gin Leu Leu Ala Gin 
225 230 235 240 



Arg Phe Lys Asn Val Cys Val Val Gly Asp Ala Asp Gin Ser lie Tyr 

245 250 255 



Gly Trp Arg Gly Ala Asp Met Gly Asn lie Leu Asn Phe Glu Lys Asp 

260 265 270 



Tyr Pro Glu Ala Gin Thr lie Phe Leu Glu Gin Asn Tyr Arg Ser Thr 
275 280 285 



Lys Ser lie lie Arg Ala Ala Asn Asp Val lie Gin Asn Asn lie Asn 
290 295 300 



Arg Arg Asp Lys Asn Leu Trp Thr Ala Asn Asp Glu Gly Asp Lys Val 
305 310 315 320 



Ser Leu Tyr Ala Ala Arg Ser Glu Gin Asp Glu Ala Gin Phe He Val 

325 330 335 



Gly Thr He His Asp Leu Thr Glu Gly Lys Lys Ala Gly Tyr Gly Asp 

340 345 350 



He Ala He Leu Tyr Arg Thr Asn Ala Met Ser Arg Val He Glu Glu 
355 " 360 " 365 
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Thr Phe lie Lys Ser Asn lie Pro Tyr Lys lie Val Gly Gly Thr Gly 
370 375 380 



Phe Tyr Gin Arg Lys Glu lie Arg Asp Leu lie Ala Tyr Leu Thr Leu 
385 390 395 400 



Val Ala Asn Pro Ala Asp Asp Leu Ser Phe Ser Arg lie Val Asn Glu 

405 410 415 



Pro Lys Arg Gly lie Gly Pro Gly Thr Leu Asp Lys Leu Arg Gin Ala 

420 425 430 



Gly Gin Glu Met Gly Trp Ser Leu Tyr Glu Thr Ala Leu Asn Ala Asp 
435 440 445 



Ala Thr Asn Leu Pro Ser Arg Ala Val Asn Arg Leu Leu Asp Phe Ser 
450 455 460 



Gin Met lie Glu Asn Phe Arg Lys Met Thr Glu Tyr Leu Pro He Thr 
465 470 475 480 



Asp Leu Thr Glu Lys He Leu Glu Asp Thr Gly Tyr Gin Lys Ala Leu 

485 490 495 



Glu Lys Asp Arg Thr Leu Glu Ser Gin Ala Arg Leu Glu Asn Leu Gin 

500 505 510 



Glu Phe Tyr Ser Val Thr Gin Glu Phe Asp Gin Gin Glu Asp Asp Asn 
515 • 520 525 



Lys Ser Leu Leu Ala Phe Leu Thr Asp Leu Ser Leu Leu Ser Pro Ala 
530 535 540 



Asp Asp Val Glu Glu Gly Arg Gly Gin Val Thr Met Met Thr Leu His 
545 550 555 560 



Ala Ala Lys Gly Leu Glu Phe Pro Tyr Val Phe He Ala Gly Met Glu 

565 570 575 



Glu Gly He Phe Pro Leu Ser Arg Ala Ala Glu Asp Pro Glu Ser Leu 

580 585 590 
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Glu Glu Glu Arg Arg Leu Ala Tyr Val Gly He Thr Arg Ala Glu Gin 
595 600 605 



Ala Leu Tyr Leu Thr Arg Ala Met Met Arg Gin Leu Tyr Gly Arg Thr 
610 615 620 



Gin Ala Asn Pro Lys Ser Arg Phe Leu Ser Glu He Ser Ser Asp Leu 
625 630 635 640 



Val Gin Asp Leu Gly Ala Thr Thr Gly Ser Leu Ser Gin Thr Gly Gly 

645 650 655 



Lys Val Ser Pro Arg Leu Gly Gly Arg Lys Ala Ser Gly Tyr Lys Ala 

660 665 670 



Asn Ala Trp Ser Gin Gin Ser Val Gly Ala Thr Gly Ala Glu Lys Glu 
675 680 685 



Asp Trp Glu Val Gly Asp Lys Val His His Lys Lys Trp Gly Gin Gly 
690 695 700 



Thr He He Glu He Lys Gly Ser Gly Ser Asp Leu Gin Leu Asn lie 
705 710 715 720 



Ala Phe Pro Asp Glu Gly He Lys Pro Leu Leu Ala Ser Phe Ala Pro 

725 730 735 



He Glu Lys He 

740 



<210> 101 
<211> 1314 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (4) . . (1314) 

<223> 

<400> 101 

agt atg gac aca ate gtc att caa gga gga gac aat cga ctt gag ggt 48 

Met Asp Thr He Val He Gin Gly Gly Asp Asn Arg Leu Glu Gly 
1 5 10 15 



aca gtc aag gta gaa ggg get aag aat get gec ctt cct ate ctg get 96 
Thr Val Lys Val Glu Gly Ala Lys Asn Ala Ala Leu Pro He Leu Ala 
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20 25 30 

gcc agt ctt tta cca gaa gat ggg aaa agt cac ctg tec aat gtc ccc 144 
Ala Ser Leu Leu Pro Glu Asp Gly Lys Ser His Leu Ser Asn Val Pro 

35 40 45 

tta eta tct gat att tac acg atg caa gaa gtt ttg cgt tac tta aac 192 
Leu Leu Ser Asp lie Tyr Thr Met Gin Glu Val Leu Arg Tyr Leu Asn 
50 55 60 

gtt gac att gac ttc gat gaa gac cac aac gaa ate gtc ata gat get 240 
Val Asp lie Asp Phe Asp Glu Asp His Asn Glu lie Val He Asp Ala 
65 70 75 

aca gga gac ctg aat tec aat ace cct tat gaa ttt atg age aag atg 288 
Thr Gly Asp Leu Asn Ser Asn Thr Pro Tyr Glu Phe Met Ser Lys Met 
80 85 90 95 

egg get tec ate att gtc atg ggt ccc tta eta gcc cgt aat ggt tat 336 
Arg Ala Ser He lie Val Met Gly Pro Leu Leu Ala Arg Asn Gly Tyr 

100 105 110 

gcc aaa gtc get ctt cct ggt ggt tgc gcg att ggg act cgt cct att 384 
Ala Lys Val Ala Leu Pro Gly Gly Cys Ala He Gly Thr Arg Pro He 

115 120 125 

gac ttg cac tta aaa ggc ttc egg get atg ggg gtc gat gtg gaa gtc 432 
Asp Leu His Leu Lys Gly Phe Arg Ala Met Gly Val Asp Val Glu Val 
130 135 140 

gaa gga ggt tat gtg ate gcc aca gtt caa gat gaa ctg gat ggc get 480 
Glu Gly Gly Tyr Val He Ala Thr Val Gin Asp Glu Leu Asp Gly Ala 
145 150 155 

gat att tac ctt gac ttc cca agt gtt gga get aca caa aat att ttg 528 
Asp He Tyr Leu Asp Phe Pro Ser Val Gly Ala Thr Gin Asn He Leu 
160 165 170 175 

atg get gcc ace egg gca aaa ggg aca aca gtc ate gag aat gca get 57 6 

Met Ala Ala Thr Arg Ala Lys Gly Thr Thr Val He Glu Asn Ala Ala 

180 185 190 

cga gaa cct gaa att gtt gac ctt gcc aac tat ttg aac aag atg ggt 624 
Arg Glu Pro Glu He Val Asp Leu Ala Asn Tyr Leu Asn Lys Met Gly 

195 200 205 

gcc cgt att tac ggg gcc gga acc aat ace atg aga att gaa ggg gta 672 
Ala Arg He Tyr Gly Ala Gly Thr Asn Thr Met Arg He Glu Gly Val 
210 . 215 220 

gac aag eta gaa get tgt gac cac tec att att gcc gac egg ata gaa 720 
Asp Lys Leu Glu Ala Cys Asp His Ser He He Ala Asp Arg He Glu 
225 230 235 

agt ggc acc ttt atg gta gca get ggt gtc acc caa ggg aat gtc ttg 768 
Ser Gly Thr Phe Met Val Ala Ala Gly Val Thr Gin Gly Asn Val Leu 
240 245 250 255 
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att gaa gac tgt ate gtc gaa cac aac cgc ccc tta att tec aag tta 816 
lie Glu Asp Cys He Val Glu His Asn Arg Pro Leu He Ser Lys Leu 

260 265 270 

agt gaa atg ggc gtt caa ttt gag gaa gaa aaa ace ggc ctt cga gtc 864 
Ser Glu Met Gly Val Gin Phe Glu Glu Glu Lys Thr Gly Leu Arg Val 

275 280 285 

atg gga cca gag ace tta cag gca aca gat gtt aaa acc ctg cct tat 912 
Met Gly Pro Glu Thr Leu Gin Ala Thr Asp Val Lys Thr Leu Pro Tyr 
290 295 300 

cct ggc ttc cca act gat atg cag tea ccg atg aca gtc gec caa acc 960 
Pro Gly Phe Pro Thr Asp Met Gin Ser Pro Met Thr Val Ala Gin Thr 
305 310 315 

eta get gag gga aga age ate atg aga gaa acg gtc ttc gaa aac cgc 1008 
Leu Ala Glu Gly Arg Ser He Met Arg Glu Thr Val Phe Glu Asn Arg 
320 325 330 335 

ttc atg cac atg gaa gag ctt cgt aaa atg gat gca caa ttt act gtc 1056 
Phe Met His Met Glu Glu Leu Arg Lys Met Asp Ala Gin Phe Thr Val 

340 345 350 

gat ggc cag tec ctt att ate gag ggg ggc aaa aaa etc caa ggt get 1104 
Asp Gly Gin Ser Leu He He Glu Gly Gly Lys Lys Leu Gin Gly Ala 

355 360 365 

aga gtc cag tec agt gac ttg egg get tea get tec ttg att att get 1152 
Arg Val Gin Ser Ser Asp Leu Arg Ala Ser Ala Ser Leu He He Ala 
370 375 380 

ggt tta gta get gat ggt gtc acc aaa gta acc aat ctt aac cac tta 12 00 

Gly Leu Val Ala Asp Gly Val Thr Lys Val Thr Asn Leu Asn His Leu 
385 390 395 

gac egg ggc tac tat aaa ttt cac gaa aaa tta cag caa tta ggt get 1248 
Asp Arg Gly Tyr Tyr Lys Phe His Glu Lys Leu Gin Gin Leu Gly Ala 
400 405 410 415 

tec att gaa cga ate gac gag gaa att caa gtt gac cag gaa gec age 129 6 

Ser He Glu Arg He Asp Glu Glu He Gin Val Asp Gin Glu Ala Ser 

420 425 430 

etc aaa aaa ggc gaa taa 1314 
Leu Lys Lys Gly Glu 

435 



<210> 102 
<211> 436 
<212> PRT 

<213> Alloiococcus otitidis 



<400> 102 

Met Asp Thr He Val He Gin Gly Gly Asp Asn Arg Leu Glu Gly Thr 
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10 15 



Val Lys Val Glu Gly Ala Lys Asn Ala Ala Leu Pro lie Leu Ala Ala 

20 25 30 



Ser Leu Leu Pro Glu Asp Gly Lys Ser His Leu Ser Asn Val Pro Leu 
35 40 45 



Leu Ser Asp lie Tyr Thr Met Gin Glu Val Leu Arg Tyr Leu Asn Val 
50 55 60 



Asp lie Asp Phe Asp Glu Asp His Asn Glu lie Val lie Asp Ala Thr 
65 70 75 80 



Gly Asp Leu Asn Ser Asn Thr Pro Tyr Glu Phe Met Ser Lys Met Arg 

85 90 95 



Ala Ser lie lie Val Met Gly Pro Leu Leu Ala Arg Asn Gly Tyr Ala 

100 105 110 



Lys Val Ala Leu Pro Gly Gly Cys Ala lie Gly Thr Arg Pro He Asp 
115 120 125 



Leu His Leu Lys Gly Phe Arg Ala Met Gly Val Asp Val Glu Val Glu 
130 135 140 



Gly Gly Tyr Val He Ala Thr Val Gin Asp Glu Leu Asp Gly Ala Asp 
145 150 155 1 160 



He Tyr Leu Asp Phe Pro Ser Val Gly Ala Thr Gin Asn He Leu Met 

165 170 175 



Ala Ala Thr Arg Ala Lys Gly Thr Thr Val He Glu Asn Ala Ala Arg 

180 185 190 



Glu Pro Glu He Val Asp Leu Ala Asn Tyr Leu Asn Lys Met Gly Ala 
195 200 205 



Arg He Tyr Gly Ala Gly Thr Asn Thr Met Arg He Glu Gly Val Asp 
210 215 220 



Lys Leu Glu Ala Cys Asp His Ser He He Ala Asp Arg He Glu Ser 
225 230 235 240 
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Gly Thr Phe Met Val Ala Ala Gly Val Thr Gin Gly Asn Val Leu lie 

245 250 255 



Glu Asp Cys He Val Glu His Asn Arg Pro Leu He Ser Lys Leu Ser 

260 265 270 



Glu Met Gly Val Gin Phe Glu Glu Glu Lys Thr Gly Leu Arg Val Met 
275 280 285 



Gly Pro Glu Thr Leu Gin Ala Thr Asp Val Lys Thr Leu Pro Tyr Pro 
290 295 300 



Gly Phe Pro Thr Asp Met Gin Ser Pro Met Thr Val Ala Gin Thr Leu" 
305 310 . 315 . 320 



Ala Glu Gly Arg Ser He Met Arg Glu Thr Val Phe Glu Asn Arg Phe 

325 330 335 



Met His Met Glu Glu Leu Arg Lys Met Asp Ala Gin Phe Thr Val Asp 

340 345 350 



Gly Gin Ser Leu He He Glu Gly Gly Lys Lys Leu Gin Gly Ala Arg 
355 360 365 



Val Gin Ser Ser Asp Leu Arg Ala Ser Ala Ser Leu He He Ala Gly 
370 375 380 



Leu Val Ala Asp Gly Val Thr Lys Val Thr Asn Leu Asn His Leu Asp 
385 390 395 400 



Arg Gly Tyr Tyr Lys Phe His Glu Lys Leu Gin Gin Leu Gly Ala Ser 

405 410 415 



He Glu Arg He Asp Glu Glu He Gin Val Asp Gin Glu Ala Ser Leu 

420 425 430 



Lys Lys Gly Glu 
435 



<210> 103 
<211> 1026 
<212> DNA 
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<213> Alloiococcus otitidis 

<220> 

<221> CDS 

<222> (34) . . (1026) 

<223> 

<400> 103 

acagttttaa tccagttagc cacaaggtgg gat atg atg gac tta gca gaa aaa 54 

Met Met Asp Leu Ala Glu Lys 
1 5 



caa gca ggg gtc tac caa ctt ttt gac cga ate ctg gec aac cat gec 
Gin Ala Gly Val Tyr Gin Leu Phe Asp Arg He Leu Ala Asn His Ala 
10 15 20 



102 



etc aag cat gec tat ctt ttt gaa ggt ttg gec gga tea ggc aaa ctg 150 
Leu Lys His Ala Tyr Leu Phe Glu Gly Leu Ala Gly Ser Gly Lys Leu 
25 30 35 

gag atg age egg tat att gee aag aga ctg ttt tgc ccc aac caa gac 198 
Glu Met Ser Arg Tyr He Ala Lys Arg Leu Phe Cys Pro Asn Gin Asp 
40 45 50 55 

cag gga caa get tgc caa gtt tgt ccc act tgc ttg cgc att gac cag 246 
Gin Gly Gin Ala Cys Gin Val Cys Pro Thr Cys Leu Arg He Asp Gin 

60 65 70 

ggt caa cac cct gat gtg gta gaa ata gee cct gag ggg aag gga egg 294 
Gly Gin His Pro Asp Val Val Glu He Ala Pro Glu Gly Lys Gly Arg 

75 80 85 

teg att agg gta gac egg gta cga cag gtc aag gat gee eta age aag 342 
Ser He Arg Val Asp Arg Val Arg Gin Val Lys Asp Ala Leu Ser Lys 
90 95 100 

tct ggt gtg gag agt caa aag aaa atg att ate ctt aac cag get gat 3 90 

Ser Gly Val Glu Ser Gin Lys Lys Met He He Leu Asn Gin Ala Asp 
105 110 115 

aaa atg acc ccc agt gca gee aac age ctg ctt aaa ttt ctg gaa gag . 438 

Lys Met Thr Pro Ser Ala Ala Asn Ser Leu Leu Lys Phe Leu Glu Glu 
120 125 130 135 

ccg gca ggg gat gtg act att ttc ttg tta gtt act age egg caa aac 486 
Pro Ala Gly Asp Val Thr He Phe Leu Leu Val Thr Ser Arg Gin Asn 

140 145 150 

ctt ttg cca act att gtt tec cgc tgc cag gtt ate cag ttt gee aag 534 
Leu Leu Pro Thr He Val Ser Arg Cys Gin Val He Gin Phe Ala Lys 

155 160 165 

cag gat tta aag act egg att gag gac tta gtg gaa gee ggt ttg tec 582 
Gin Asp Leu Lys Thr Arg He Glu Asp Leu Val Glu Ala Gly Leu Ser 
170 175 180 



cag gaa gaa gec cac ttg gec age cac etc age caa gac tta gac ttg 



630 
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Gin Glu Glu Ala His Leu Ala Ser His Leu Ser Gin Asp Leu Asp Leu 
185 190 195 

get aag tec etc att gag gaa gag gac ttg ctg gca gtc agt caa aaa 
Ala Lys Ser Leu lie Glu Glu Glu Asp Leu Leu Ala Val Ser Gin Lys 
200 205 210 215 

att tgg cag tgg ttt age tat etc atg aac caa gat gac ttg gec ttt 
lie Trp Gin Trp Phe Ser Tyr Leu Met Asn Gin Asp Asp Leu Ala Phe 

220 225 230 

ate eta gtc caa aga gac tta atg gee ttt ate caa gac egg gat gac 
lie Leu Val Gin Arg Asp Leu Met Ala Phe lie Gin Asp Arg Asp Asp 

235 240 245 

tgc cag atg gtt tgt gac tta ate etc tac etc ttc caa gac ctg etc 
Cys Gin Met Val Cys Asp Leu lie Leu Tyr Leu Phe Gin Asp Leu Leu 
250 255 260 

cac tta cac tac cat tta gat agt ccg gee tgc ttc gca ggc cac gaa 
His Leu His Tyr His Leu Asp Ser Pro Ala Cys Phe Ala Gly His Glu 
265 270 275 

agt gac etc cgc tac ttt atg gac ctg ctt teg ate aag caa gtg tct 
Ser Asp Leu Arg Tyr Phe Met Asp Leu Leu Ser lie Lys Gin Val Ser 
280 285 290 295 

tat gee atg caa gee acc ctg caa get aaa aga gaa gtg gac cac aat 
Tyr Ala Met Gin Ala Thr Leu Gin Ala Lys Arg Glu Val Asp His Asn 

300 305 310 

gtg gee agt cag get gtt tta gaa ggc ttg act ttg gac ttg cag gaa 
Val Ala Ser Gin Ala Val Leu Glu Gly Leu Thr Leu Asp Leu Gin Glu 

315 320 325 

agt ata ggc taa 
Ser He Gly 
330 



<210> 104 
<211> 330 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 104 

Met Met Asp Leu Ala Glu Lys Gin Ala Gly Val Tyr Gin Leu Phe Asp 
15 10 15 



Arg He Leu Ala Asn His Ala Leu Lys His Ala Tyr Leu Phe Glu Gly 

20 25 30 



Leu Ala Gly Ser Gly Lys Leu Glu Met Ser Arg Tyr He Ala Lys Arg 
35 40 ~ 45 
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Leu Phe Cys Pro Asn Gin Asp Gin Gly Gin Ala Cys Gin Val Cys Pro 
50 55 60 



Thr Cys Leu Arg He Asp Gin Gly Gin His Pro Asp Val Val Glu He 
65 ' 70 75 80 



Ala Pro Glu Gly Lys Gly Arg Ser He Arg Val Asp Arg Val Arg Gin 

85 90 95 



Val Lys Asp Ala Leu Ser Lys Ser Gly Val Glu Ser Gin Lys Lys Met 

100 105 110 



He He Leu Asn Gin Ala Asp Lys Met Thr Pro Ser Ala Ala Asn Ser 
115 120 125 



Leu Leu Lys Phe Leu Glu Glu Pro Ala Gly Asp Val Thr He Phe Leu 
130 135 140 



Leu Val Thr Ser Arg Gin Asn Leu Leu Pro Thr lie Val Ser Arg Cys 
145 150 155 160 



Gin Val He Gin Phe Ala Lys Gin Asp Leu Lys Thr Arg He Glu Asp 

165 170 175 



Leu Val Glu Ala Gly Leu Ser Gin Glu Glu Ala His Leu Ala Ser His 

180 185 190 



Leu Ser Gin Asp Leu Asp Leu Ala Lys Ser Leu He Glu Glu Glu Asp 
195 200 205 



Leu Leu Ala Val Ser Gin Lys He Trp Gin Trp Phe Ser Tyr Leu Met 
210 215 220 



Asn Gin Asp Asp Leu Ala Phe He Leu Val Gin Arg Asp Leu Met Ala 
225 ~ 230 235 240 



Phe He Gin Asp Arg Asp Asp Cys Gin Met Val Cys Asp Leu He Leu 

245 250 255 



Tyr Leu Phe Gin Asp Leu Leu His Leu His Tyr His Leu Asp Ser Pro 

260 265 270 
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Ala Cys Phe Ala Gly His Glu Ser Asp Leu Arg Tyr Phe Met Asp Leu 
275 280 285 



Leu Ser lie Lys Gin Val Ser Tyr Ala Met Gin Ala Thr Leu Gin Ala 
290 295 300 



Lys Arg Glu Val Asp His Asn Val Ala Ser Gin Ala Val Leu Glu Gly 
305 310 315 320 



Leu Thr Leu Asp Leu Gin Glu Ser lie Gly 

325 330 



<210> 105 
<211> 1785 
<212> DNA 

<213> Alloiococcus otitidis 

<220> 
<221> CDS 

<222> (13) . . (1785) 
<223> 

<400> 105 

gaggggagag ct atg acc cac cag gcc tta tac egg gta tgg cga ccg caa 51 

Met Thr His Gin Ala Leu Tyr Arg Val Trp Arg Pro Gin 
15 10 

agt ttt get gat gta tec ggc cag cat gtg gtc acc aag acc eta aag 99 
Ser Phe Ala Asp Val Ser Gly Gin His Val Val Thr Lys Thr Leu Lys 
15 20 25 

aat gcc att aaa aat gat aat acc agt cat gcc tac ctg ttt act gga 147 
Asn Ala lie Lys Asn Asp Asn Thr Ser His Ala Tyr Leu Phe Thr Gly 
30 35 40 45 

ccc egg ggg acg ggc aag acc agt gtg gca aaa ata ttt gcc aag gcc 195 
Pro Arg Gly Thr Gly Lys Thr Ser Val Ala Lys lie Phe Ala Lys Ala 

50 55 60 

att aat tgc ccc tac teg gat gat ggg gag cct tgt aat gaa tgt cag 243 
lie Asn Cys Pro Tyr Ser Asp Asp Gly Glu Pro Cys Asn Glu Cys Gin 

65 70 75 

att tgc cag gag ate acc cag ggt agt eta ggc gat gtc ate gaa ate 291 
lie Cys Gin Glu lie Thr Gin Gly Ser Leu Gly Asp Val lie Glu lie 
80 85 90 

gat gcg gcc age aat aat ggg gtg gaa gag att cgc gat att agg gaa 33 9 

Asp Ala Ala Ser Asn Asn Gly Val Glu Glu lie Arg Asp lie Arg Glu 
95 100 105 

aag get aat cat g-cc cca act teg gcc gtt tac aag gtc tac att ate 387 
Lys Ala Asn Tyr Ala Pro Thr Ser Ala Val Tyr Lys Val Tyr He He 
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110 115 120 125 

gat gag gtc cat atg tta tec tct ggg gec ttt aac gec etc ttg aaa 435 
Asp Glu Val His Met Leu Ser Ser Gly Ala Phe Asn Ala Leu Leu Lys 

130 135 140 

aca ctg gaa gag cct cca gec aat gtg gtc ttt ate tta gca acg act 483 
Thr Leu Glu Glu Pro Pro Ala Asn Val Val Phe lie Leu Ala Thr Thr 

145 150 155 

gaa ccc cac aag att ccg get acc att ate tec egg ace cag cgt ttt 531 
Glu Pro His Lys lie Pro Ala Thr lie lie Ser Arg Thr Gin Arg Phe 
160 165 170 

gat ttt aag egg att gac aac cag gac ate ate gac cgc ttg att tat 579 
Asp Phe Lys Arg lie Asp Asn Gin Asp lie He Asp Arg. Leu He Tyr 
175 180 185 

ate tta gaa gaa gac cag gtc ccc tac age aaa gaa gee gtc eta age 627 
He Leu Glu Glu Asp Gin Val Pro Tyr Ser Lys Glu Ala Val Leu Ser 
190 195 200 205 

eta gee aat gca gcg gaa ggt ggg atg egg gat gee ttg agt atg ttg 675 
Leu Ala Asn Ala Ala Glu Gly Gly Met Arg Asp Ala Leu Ser Met Leu 

210 : 215 220 

gac cag gee tta age ttt atg aca gat gag tta aca gaa gaa gtt gee 723 
Asp Gin Ala Leu Ser Phe Met Thr Asp Glu Leu Thr Glu Glu Val Ala 

225 230 235 

etc cag att aca ggg age att acc cag tct etc ttg ctt gaa tac ttg 771 
Leu Gin He Thr Gly Ser He Thr Gin Ser Leu Leu Leu Glu Tyr Leu 
240 245 250 

cag gtg att age caa ggt cag acg gaa gaa gga etc aag etc ttg caa 819 
Gin Val He Ser Gin Gly Gin Thr Glu Glu Gly Leu Lys Leu Leu Gin 
255 260 265 

gaa gtt tta ggg gaa ggc aag gac cct age egg ttt gtg gaa gac get 867 
Glu Val Leu Gly Glu Gly Lys Asp Pro Ser Arg Phe Val Glu Asp Ala 
270 275 280 285 

att atg atg acc egg gac etc ttg ctt tac caa act age caa ggc gat 915 
He Met Met Thr Arg Asp Leu Leu Leu Tyr Gin Thr Ser Gin Gly Asp 

290 295 300 

aat ttt gtt cct aaa ttg get cgc tta gac gac cag ttt gaa gac ctg 963 
Asn Phe Val Pro Lys Leu Ala Arg Leu Asp Asp Gin Phe Glu Asp Leu 

305 " 310 315 

gcg aag gac ttg gac aag gag atg gee tac cat att att gat gtc tta 1011 
Ala Lys Asp Leu Asp Lys Glu Met Ala Tyr His He He Asp Val Leu 
320 325 330 



aac caa acc caa gac gat etc cgc eta age aac cat ggg gaa gtc tat 1059 
Asn Gin Thr Gin Asp Asp Leu Arg Leu Ser Asn His Gly Glu Val Tyr 
335 340 345 
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ttg gaa ata gcc acg gtc aag ctt age cag cct tct tea gec gtt cag 
Leu Glu lie Ala Thr Val Lys Leu Ser Gin Pro Ser Ser Ala Val Gin 
350 355 360 365 



gag att gcc caa ctg caa aac cag gtc aag tec etc cag caa agt att 
Glu lie Ala Gin Leu Gin Asn Gin Val Lys Ser Leu Gin Gin Ser lie 

385 390 395 



tea aaa get ggc ccc aag caa tct ggc cct ggc aag tct aga age cac 
Ser Lys Ala Gly Pro Lys Gin Ser Gly Pro Gly Lys Ser Arg Ser His 
415 420 425 

cgt cac cag caa ggc ttc aag gtt aac egg aaa gcc gtt tac tct ate 
Arg His Gin Gin Gly Phe Lys Val Asn Arg Lys Ala Val Tyr Ser lie 
430 435 440 445 



cca gac ttg ate aat gtc ttg ace ate agt caa aag get ate tta aac 
Pro Asp Leu He Asn Val Leu Thr lie Ser Gin Lys Ala He Leu Asn 

465 47 0 475 



acg get ate ggc aat tac ate gaa aaa att ate ggc cgc cgt cca aga 
Thr Ala He Gly Asn Tyr He Glu Lys He He Gly Arg Arg Pro Arg 
510 515 520 525 



ate aag cag atg aaa aaa gaa gat ggc agt act aaa get ggc caa gca 
lie Lys Gin Met Lys Lys Glu Asp Gly Ser Thr Lys Ala Gly Gin Ala 

545 550 555 



1107 



acc ate cag gcc age caa gtc aac atg gtg gac cag gat aat aaa gaa 1155 
Thr He Gin Ala Ser Gin Val Asn Met Val Asp Gin Asp Asn Lys Glu 

370 375 380 



1203 



caa aac ttg caa get gga gcc aaa caa ggg cct aag caa aga get aag 1251 
Gin Asn Leu Gin Ala Gly Ala Lys Gin Gly Pro Lys Gin Arg Ala Lys 
400 405 410 



1299 



1347 



ttg gac cag gcg acc cgt aaa gac ctg gac gac etc caa gac etc tgg 1395 
Leu Asp Gin Ala Thr Arg Lys Asp Leu Asp Asp Leu Gin Asp Leu Trp 

450 455 460 



1443 



aat tec aaa cca gtt get get agt cca gag ggt ttg gtg gtg acc ttt' 1491 
Asn Ser Lys Pro Val Ala Ala Ser Pro Glu Gly Leu Val Val Thr Phe 
480 485 490 

gaa tat gat att eta tgt gag aga gca gag tct gac gag acc ttg caa 1539 
Glu Tyr Asp He Leu Cys Glu Arg Ala Glu Ser Asp Glu Thr Leu Gin 
495 500 " 505 



1587 



ctg gtc tgt gtg cct gaa gac aag tgg ccg act ate cgc cgc gat ttt 1635 
Leu Val Cys Val Pro Glu Asp Lys Trp Pro Thr He Arg Arg Asp Phe 

530 535 540 



1683 



agt gac ggc aag teg gat gat gac cca ggt caa gaa gac aac cag gcc 1731 
Ser Asp Gly Lys Ser Asp Asp Asp Pro Gly Gin Glu Asp Asn Gin Ala 
560 565 570 
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ctt aac aag get gtg gag ctt ttc ggt aaa gac aat att aca ate aaa 1779 
Leu Asn Lys Ala Val Glu Leu Phe Gly Lys Asp Asn lie Thr He Lys 
575 580 585 

gat taa 1785 

Asp 

590 



<210> 106 
<211> 590 
<212> PRT 

<213> Alloiococcus otitidis 
<400> 106 

Met Thr His Gin Ala Leu Tyr Arg Val Trp Arg Pro Gin Ser Phe Ala 
15 10 15 



Asp Val Ser Gly Gin His Val Val Thr Lys Thr Leu Lys Asn Ala lie 

20 25 30 



Lys Asn Asp Asn Thr Ser His Ala Tyr Leu Phe Thr Gly Pro Arg Gly 
35 40 45 



Thr Gly Lys Thr Ser Val Ala Lys He Phe Ala Lys Ala He Asn Cys 
50 55 60 



Pro Tyr Ser Asp Asp Gly Glu Pro Cys Asn Glu Cys Gin He Cys Gin 
65 70 75 80 



Glu He Thr Gin Gly Ser Leu Gly Asp Val He Glu He Asp Ala Ala 

85 90 95 



Ser Asn Asn Gly Val Glu Glu He Arg Asp He Arg Glu Lys Ala Asn 

100 105 110 



Tyr Ala Pro Thr Ser Ala Val Tyr Lys Val Tyr He He Asp Glu Val 
115 120 125 



His Met Leu Ser Ser Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu 
130 135 140 



Glu Pro Pro Ala Asn Val Val Phe He Leu Ala Thr Thr Glu Pro His 
145 150 155 160 



Lys Xle Pro Ala Thr He He Ser Arg Thr Gin Arg Phe Asp Phe Lys 

165 170 175 
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Arg He Asp Asn Gin Asp He He Asp Arg Leu He Tyr He Leu Glu 

180 185 190 



Glu Asp Gin Val Pro Tyr Ser Lys Glu Ala Val Leu Ser Leu Ala Asn 
195 200 205 



Ala Ala Glu Gly Gly Met Arg Asp Ala Leu Ser Met Leu Asp Gin Ala 
210 215 220 



Leu Ser Phe Met Thr Asp Glu Leu Thr Glu Glu Val Ala Leu Gin He 
225 230 235 240 



Thr Gly Ser He Thr Gin Ser Leu Leu Leu Glu Tyr Leu Gin Val He 

245 250 255 



Ser Gin Gly Gin Thr Glu Glu Gly Leu Lys Leu Leu Gin Glu Val Leu 

260 265 270 



Gly Glu Gly Lys Asp Pro Ser Arg Phe Val Glu Asp Ala He Met Met 
275 • 280 285 



Thr Arg Asp Leu Leu Leu Tyr Gin Thr Ser Gin Gly Asp Asn Phe Val 
290 295 300 



Pro Lys Leu Ala Arg Leu Asp Asp Gin Phe Glu Asp Leu Ala Lys Asp 
305 310 315 320 



Leu Asp Lys Glu Met Ala Tyr His He He Asp Val Leu Asn Gin Thr 

325 330 335 



Gin Asp Asp Leu Arg Leu Ser Asn His Gly Glu Val Tyr Leu Glu He 

340 345 350 



Ala Thr Val Lys Leu Ser Gin Pro Ser Ser Ala Val Gin Thr He Gin 
355 " 360 365 



Ala Ser Gin Val Asn Met Val Asp Gin Asp Asn Lys Glu Glu He Ala 
370 375 380 



Gin Leu Gin Asn Gin Val Lys Ser Leu Gin Gin Ser He Gin Asn Leu 
385 390 395 400 
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Gin Ala Gly Ala Lys Gin Gly Pro Lys Gin Arg Ala Lys Ser Lys Ala 

405 410 415 

Gly Pro Lys Gin Ser Gly Pro Gly Lys Ser Arg Ser His Arg His Gin 

420 425 430 



Gin Gly Phe Lys Val Asn Arg Lys Ala Val Tyr Ser lie Leu Asp Gin 
435 440 445 



Ala Thr Arg Lys Asp Leu Asp Asp Leu Gin Asp Leu Trp Pro Asp Leu 
450 455 460 



lie Asn Val Leu Thr He Ser Gin Lys Ala He Leu Asn Asn Ser Lys 
465 470 475 480 



Pro Val Ala Ala Ser Pro Glu Gly Leu Val Val Thr Phe Glu Tyr Asp 

485 490 495 

He Leu Cys Glu Arg Ala Glu Ser Asp Glu Thr Leu Gin Thr Ala He 

500 505 510 



Gly Asn Tyr lie Glu Lys He lie Gly Arg Arg Pro Arg Leu Val Cys 
515 520 525 



Val Pro Glu Asp Lys Trp Pro Thr lie Arg Arg Asp Phe He Lys Gin 
530 535 5 40 



Met Lys Lys Glu Asp Gly Ser Thr Lys Ala Gly Gin Ala Ser Asp Gly 
545 550 555 560 



Lys Ser Asp Asp Asp Pro Gly Gin Glu Asp Asn Gin Ala Leu Asn Lys 

565 570 575 



Ala Val Glu Leu Phe Gly Lys Asp Asn He Thr He Lys Asp 

580 585 590 



